Simulation of multicore vector dataflow processor with shared memory
Автор: Dikarev Nikolay Ivanovich, Shabanov Boris Mikhaylovich, Shmelev Aleksandr Sergeyevich
Журнал: Программные системы: теория и приложения @programmnye-sistemy
Рубрика: Программное и аппаратное обеспечение для супер ЭВМ
Статья в выпуске: 1 (36) т.9, 2018 года.
Бесплатный доступ
A Dataflow processor can execute up to 16 instructions per cycle compared to 4 to 6 instructions of the best von Neumann processors. Simulation of the vector dataflow processor (VDP) showed that it is possible to raise its core vector performance up to 256 flops per clock, and using modern manufacturing process to implement up to 4 such cores on a single die. Simulation results of the matrix multiplication program and 2D Stencil on double core VDP with shared memory are given in this paper. It is shown that the matrix multiplication program scales well on VDP, while the performance of 2D Stencil is limited by the shared memory bandwidth. (In Russian). (in Russian).
2d stancil, 2d stencil, dataflow architecture, matrix multiplication, performance evaluation, supercomputer, vector processor
Короткий адрес: https://sciup.org/143164297
IDR: 143164297 | DOI: 10.25209/2079-3316-2017-9-1-37-52