T-Sim fault tolerance

Автор: Tyutlyaeva Ekaterina Olegovna, Moskovskii Aleksandr Aleksandrovich

Журнал: Программные системы: теория и приложения @programmnye-sistemy

Статья в выпуске: 3 (7) т.2, 2011 года.

Бесплатный доступ

This paper addresses fault-tolerance challenges in distributed computing environment. Increasing scalability of modern computational clusters leads to an increasing probability of an interrupt occuring. In a number of cases computational algorithms, such as genetic algorithms, Monte Carlo based algorithms, have the mathematical properties that they get the correct answer despite the occurrence of faults in the system. This paper proposes methods for implementation such class of algorithms despite software and hardware faults. Some example of monotonous reducing object is implemented using C++ template class library T-Sim. Moreover, some test realizations are implemented.

Еще

Local synchronization, monotonous object, t-sim c++ template library, fault-tolerance

Короткий адрес: https://sciup.org/14335909

IDR: 14335909

Статья научная