A1 Journal article – refereed

Performance and programmability comparison of the thick control flow architecture and current multicore processors

List of Authors: Forsell Martti, Nikula Sara, Roivainen Jussi, Leppänen Ville, Träff Jesper Larsson

Publisher: SPRINGER

Publication year: 2021

Journal: Journal of Supercomputing

Journal name in source: JOURNAL OF SUPERCOMPUTING

Journal acronym: J SUPERCOMPUT

Number of pages: 32

ISSN: 0920-8542

eISSN: 1573-0484

DOI: http://dx.doi.org/10.1007/s11227-021-03985-0

Commercial multicore central processing units (CPU) integrate a number of processor cores on a single chip to support parallel execution of computational tasks. Multicore CPUs can possibly improve performance over single cores for independent parallel tasks nearly linearly as long as sufficient bandwidth is available. Ideal speedup is, however, difficult to achieve when dense intercommunication between the cores or complex memory access patterns is required. This is caused by expensive synchronization and thread switching, and insufficient latency toleration. These facts guide programmers away from straight-forward parallel processing patterns toward complex and error-prone programming techniques. To address these problems, we have introduced the Thick control flow (TCF) Processor Architecture. TCF is an abstraction of parallel computation that combines self-similar threads into computational entities. In this paper, we compare the performance and programmability of an entry-level TCF processor and two Intel Skylake multicore CPUs on commonly used parallel kernels to find out how well our architecture solves these issues that greatly reduce the productivity of parallel software development. Code examples are given and programming experiences recorded.

Downloadable publication

This is an electronic reprint of the original article.
This reprint may differ from the original in pagination and typographic detail. Please cite the original version.

Last updated on 2021-26-08 at 11:39