Accelerating gustavson-based SpMM on embedded FPGAs with element-wise parallelism and access pattern-aware caches

The Gustavson’s algorithm (i.e., the row-wise product algorithm) shows its potential as the backbone algorithm for sparse matrix-matrix multiplication (SpMM) on hardware accelerators. However, it still suffers from irregular memory accesses and thus its performance is bounded by the off-chip memory...

全面介紹

Saved in:
書目詳細資料
Main Authors: Li, Shiqing, Liu, Weichen
其他作者: School of Computer Science and Engineering
格式: Conference or Workshop Item
語言:English
出版: 2023
主題:
在線閱讀:https://hdl.handle.net/10356/167477
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!