Abstract : mapping streaming applications onto GPU systems
We describe an efficient and scalable code generation framework that automatically maps general purpose streaming applications onto GPU systems. This architecture-driven framework takes into account the idiosyncrasies of the GPU pipeline and the unique memory hierarchy. The framework has been implem...
Saved in:
Main Authors: | , , , , |
---|---|
Other Authors: | |
Format: | Conference or Workshop Item |
Language: | English |
Published: |
2013
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/84219 http://hdl.handle.net/10220/13114 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-84219 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-842192020-05-28T07:17:15Z Abstract : mapping streaming applications onto GPU systems Huynh, Huynh Phung. Wong, Weng-Fai. Ray, A. Goh, Rick Siow Mong. Hagiescu, Andrei. School of Computer Engineering SC Companion: High Performance Computing, Networking, Storage and Analysis (2012 : Salt Lake City, Utah, United States) Parallel and Distributed Computing Centre DRNTU::Engineering::Computer science and engineering We describe an efficient and scalable code generation framework that automatically maps general purpose streaming applications onto GPU systems. This architecture-driven framework takes into account the idiosyncrasies of the GPU pipeline and the unique memory hierarchy. The framework has been implemented as a back-end to the StreamIt programming language compiler. Several key features in this framework ensure maximized performance and scalability. First, the generated code increases the effectiveness of the on-chip memory hierarchy by employing a heterogeneous mix of compute and memory access threads. Our scheme goes against the conventional wisdom of GPU programming which is to use a large number of homogeneous threads. Second, we utilise an efficient stream graph partitioning algorithm to handle larger applications and achieve the best performance under the given on-chip memory constraints. Lastly, the framework maps complex applications onto multiple GPUs using a highly effective pipeline execution scheme. Our comprehensive experiments show its scalability and significant speedup compared to a state-of-the-art solution. 2013-08-15T07:01:21Z 2019-12-06T15:40:47Z 2013-08-15T07:01:21Z 2019-12-06T15:40:47Z 2012 2012 Conference Paper https://hdl.handle.net/10356/84219 http://hdl.handle.net/10220/13114 10.1109/SC.Companion.2012.279 en |
institution |
Nanyang Technological University |
building |
NTU Library |
country |
Singapore |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering::Computer science and engineering |
spellingShingle |
DRNTU::Engineering::Computer science and engineering Huynh, Huynh Phung. Wong, Weng-Fai. Ray, A. Goh, Rick Siow Mong. Hagiescu, Andrei. Abstract : mapping streaming applications onto GPU systems |
description |
We describe an efficient and scalable code generation framework that automatically maps general purpose streaming applications onto GPU systems. This architecture-driven framework takes into account the idiosyncrasies of the GPU pipeline and the unique memory hierarchy. The framework has been implemented as a back-end to the StreamIt programming language compiler. Several key features in this framework ensure maximized performance and scalability. First, the generated code increases the effectiveness of the on-chip memory hierarchy by employing a heterogeneous mix of compute and memory access threads. Our scheme goes against the conventional wisdom of GPU programming which is to use a large number of homogeneous threads. Second, we utilise an efficient stream graph partitioning algorithm to handle larger applications and achieve the best performance under the given on-chip memory constraints. Lastly, the framework maps complex applications onto multiple GPUs using a highly effective pipeline execution scheme. Our comprehensive experiments show its scalability and significant speedup compared to a state-of-the-art solution. |
author2 |
School of Computer Engineering |
author_facet |
School of Computer Engineering Huynh, Huynh Phung. Wong, Weng-Fai. Ray, A. Goh, Rick Siow Mong. Hagiescu, Andrei. |
format |
Conference or Workshop Item |
author |
Huynh, Huynh Phung. Wong, Weng-Fai. Ray, A. Goh, Rick Siow Mong. Hagiescu, Andrei. |
author_sort |
Huynh, Huynh Phung. |
title |
Abstract : mapping streaming applications onto GPU systems |
title_short |
Abstract : mapping streaming applications onto GPU systems |
title_full |
Abstract : mapping streaming applications onto GPU systems |
title_fullStr |
Abstract : mapping streaming applications onto GPU systems |
title_full_unstemmed |
Abstract : mapping streaming applications onto GPU systems |
title_sort |
abstract : mapping streaming applications onto gpu systems |
publishDate |
2013 |
url |
https://hdl.handle.net/10356/84219 http://hdl.handle.net/10220/13114 |
_version_ |
1681059138822471680 |