Enhancing performance of Tall-Skinny QR factorization using FPGAs
Communication-avoiding linear algebra algorithms with low communication latency and high memory bandwidth requirements like Tall-Skinny QR factorization (TSQR) are highly appropriate for acceleration using FPGAs. TSQR parallelizes QR factorization of tall-skinny matrices in a divide-and-conquer fash...
Saved in:
Main Authors: | Rafique, Abid, Kapre, Nachiket, Constantinides, George A. |
---|---|
其他作者: | School of Computer Engineering |
格式: | Conference or Workshop Item |
語言: | English |
出版: |
2015
|
主題: | |
在線閱讀: | https://hdl.handle.net/10356/81242 http://hdl.handle.net/10220/39153 |
標簽: |
添加標簽
沒有標簽, 成為第一個標記此記錄!
|
機構: | Nanyang Technological University |
語言: | English |
相似書籍
-
Application composition and communication optimization in iterative solvers using FPGAs
由: Rafique, Abid, et al.
出版: (2013) -
Communication Optimization of Iterative Sparse Matrix-Vector Multiply on GPUs and FPGAs
由: Rafique, Abid, et al.
出版: (2015) -
Accelerating SPICE Model-Evaluation using FPGAs
由: Kapre, Nachiket, et al.
出版: (2015) -
Hoplite: Building austere overlay NoCs for FPGAs
由: Kapre, Nachiket, et al.
出版: (2015) -
Parallelizing Sparse Matrix Solve for SPICE Circuit Simulation using FPGAs
由: Kapre, Nachiket, et al.
出版: (2015)