Abstract : mapping streaming applications onto GPU systems

We describe an efficient and scalable code generation framework that automatically maps general purpose streaming applications onto GPU systems. This architecture-driven framework takes into account the idiosyncrasies of the GPU pipeline and the unique memory hierarchy. The framework has been implem...

Full description

Saved in:
Bibliographic Details
Main Authors: Huynh, Huynh Phung., Wong, Weng-Fai., Ray, A., Goh, Rick Siow Mong., Hagiescu, Andrei.
Other Authors: School of Computer Engineering
Format: Conference or Workshop Item
Language:English
Published: 2013
Subjects:
Online Access:https://hdl.handle.net/10356/84219
http://hdl.handle.net/10220/13114
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-84219
record_format dspace
spelling sg-ntu-dr.10356-842192020-05-28T07:17:15Z Abstract : mapping streaming applications onto GPU systems Huynh, Huynh Phung. Wong, Weng-Fai. Ray, A. Goh, Rick Siow Mong. Hagiescu, Andrei. School of Computer Engineering SC Companion: High Performance Computing, Networking, Storage and Analysis (2012 : Salt Lake City, Utah, United States) Parallel and Distributed Computing Centre DRNTU::Engineering::Computer science and engineering We describe an efficient and scalable code generation framework that automatically maps general purpose streaming applications onto GPU systems. This architecture-driven framework takes into account the idiosyncrasies of the GPU pipeline and the unique memory hierarchy. The framework has been implemented as a back-end to the StreamIt programming language compiler. Several key features in this framework ensure maximized performance and scalability. First, the generated code increases the effectiveness of the on-chip memory hierarchy by employing a heterogeneous mix of compute and memory access threads. Our scheme goes against the conventional wisdom of GPU programming which is to use a large number of homogeneous threads. Second, we utilise an efficient stream graph partitioning algorithm to handle larger applications and achieve the best performance under the given on-chip memory constraints. Lastly, the framework maps complex applications onto multiple GPUs using a highly effective pipeline execution scheme. Our comprehensive experiments show its scalability and significant speedup compared to a state-of-the-art solution. 2013-08-15T07:01:21Z 2019-12-06T15:40:47Z 2013-08-15T07:01:21Z 2019-12-06T15:40:47Z 2012 2012 Conference Paper https://hdl.handle.net/10356/84219 http://hdl.handle.net/10220/13114 10.1109/SC.Companion.2012.279 en
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
topic DRNTU::Engineering::Computer science and engineering
spellingShingle DRNTU::Engineering::Computer science and engineering
Huynh, Huynh Phung.
Wong, Weng-Fai.
Ray, A.
Goh, Rick Siow Mong.
Hagiescu, Andrei.
Abstract : mapping streaming applications onto GPU systems
description We describe an efficient and scalable code generation framework that automatically maps general purpose streaming applications onto GPU systems. This architecture-driven framework takes into account the idiosyncrasies of the GPU pipeline and the unique memory hierarchy. The framework has been implemented as a back-end to the StreamIt programming language compiler. Several key features in this framework ensure maximized performance and scalability. First, the generated code increases the effectiveness of the on-chip memory hierarchy by employing a heterogeneous mix of compute and memory access threads. Our scheme goes against the conventional wisdom of GPU programming which is to use a large number of homogeneous threads. Second, we utilise an efficient stream graph partitioning algorithm to handle larger applications and achieve the best performance under the given on-chip memory constraints. Lastly, the framework maps complex applications onto multiple GPUs using a highly effective pipeline execution scheme. Our comprehensive experiments show its scalability and significant speedup compared to a state-of-the-art solution.
author2 School of Computer Engineering
author_facet School of Computer Engineering
Huynh, Huynh Phung.
Wong, Weng-Fai.
Ray, A.
Goh, Rick Siow Mong.
Hagiescu, Andrei.
format Conference or Workshop Item
author Huynh, Huynh Phung.
Wong, Weng-Fai.
Ray, A.
Goh, Rick Siow Mong.
Hagiescu, Andrei.
author_sort Huynh, Huynh Phung.
title Abstract : mapping streaming applications onto GPU systems
title_short Abstract : mapping streaming applications onto GPU systems
title_full Abstract : mapping streaming applications onto GPU systems
title_fullStr Abstract : mapping streaming applications onto GPU systems
title_full_unstemmed Abstract : mapping streaming applications onto GPU systems
title_sort abstract : mapping streaming applications onto gpu systems
publishDate 2013
url https://hdl.handle.net/10356/84219
http://hdl.handle.net/10220/13114
_version_ 1681059138822471680