Communication Optimization of Iterative Sparse Matrix-Vector Multiply on GPUs and FPGAs

Trading communication with redundant computation can increase the silicon efficiency of FPGAs and GPUs in accelerating communication-bound sparse iterative solvers. While k iterations of the iterative solver can be unrolled to provide O(k) reduction in communication cost, the extent of this unrollin...

全面介紹

Saved in:
書目詳細資料
Main Authors: Rafique, Abid, Constantinides, George A., Kapre, Nachiket
其他作者: School of Computer Engineering
格式: Article
語言:English
出版: 2015
主題:
在線閱讀:https://hdl.handle.net/10356/81168
http://hdl.handle.net/10220/39128
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!
機構: Nanyang Technological University
語言: English