A4 Refereed article in a conference publication

A novel multi-level integrated roofline model approach for performance characterization




AuthorsTuomas Koskela, Zakhar Matveev, Charlene Yang, Adetokunbo Adedoyin, Roman Belenov, Philippe Thierry, Zhengji Zhao, Rahulkumar Gayatri, Hongzhang Shan, Leonid Oliker, Jack Deslippe, Ron Green, Samuel Williams

EditorsRio Yokota, Michèle Weiland, David Keyes, Carsten Trinitis

Conference nameInternational Conference on High Performance Computing

PublisherSpringer Verlag

Publication year2018

JournalLecture Notes in Computer Science

Book title High Performance Computing

Journal name in sourceLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Series titleLecture Notes in Computer Science

Volume10876

First page 226

Last page245

Number of pages20

ISBN978-3-319-92039-9

eISBN978-3-319-92040-5

ISSN0302-9743

DOIhttps://doi.org/10.1007/978-3-319-92040-5_12

Self-archived copy’s web addresshttps://research.utu.fi/converis/portal/detail/Publication/32081756


Abstract
With energy-efficient architectures, including accelerators and many-core processors, gaining traction, application developers face the challenge of optimizing their applications for multiple hardware features including many-core parallelism, wide processing vector-units and on-chip high-bandwidth memory. In this paper, we discuss the development and utilization of a new application performance tool based on an extension of the classical roofline-model for simultaneously profiling multiple levels in the cache-memory hierarchy. This tool presents a powerful visual aid for the developer and can be used to frame the many-dimensional optimization problem in a tractable way. We show case studies of real scientific applications that have gained insights from the Integrated Roofline Model.

Downloadable publication

This is an electronic reprint of the original article.
This reprint may differ from the original in pagination and typographic detail. Please cite the original version.





Last updated on 2024-26-11 at 18:25