T-Sim fault tolerance
Автор: Tyutlyaeva Ekaterina Olegovna, Moskovskii Aleksandr Aleksandrovich
Журнал: Программные системы: теория и приложения @programmnye-sistemy
Статья в выпуске: 3 (7) т.2, 2011 года.
Бесплатный доступ
This paper addresses fault-tolerance challenges in distributed computing environment. Increasing scalability of modern computational clusters leads to an increasing probability of an interrupt occuring. In a number of cases computational algorithms, such as genetic algorithms, Monte Carlo based algorithms, have the mathematical properties that they get the correct answer despite the occurrence of faults in the system. This paper proposes methods for implementation such class of algorithms despite software and hardware faults. Some example of monotonous reducing object is implemented using C++ template class library T-Sim. Moreover, some test realizations are implemented.
Local synchronization, monotonous object, t-sim c++ template library, fault-tolerance
Короткий адрес: https://sciup.org/14335909
IDR: 14335909