Enhancing performance of Tall-Skinny QR factorization using FPGAs
Communication-avoiding linear algebra algorithms with low communication latency and high memory bandwidth requirements like Tall-Skinny QR factorization (TSQR) are highly appropriate for acceleration using FPGAs. TSQR parallelizes QR factorization of tall-skinny matrices in a divide-and-conquer fash...
Saved in:
Main Authors: | Rafique, Abid, Kapre, Nachiket, Constantinides, George A. |
---|---|
Other Authors: | School of Computer Engineering |
Format: | Conference or Workshop Item |
Language: | English |
Published: |
2015
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/81242 http://hdl.handle.net/10220/39153 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
Similar Items
-
Application composition and communication optimization in iterative solvers using FPGAs
by: Rafique, Abid, et al.
Published: (2013) -
Communication Optimization of Iterative Sparse Matrix-Vector Multiply on GPUs and FPGAs
by: Rafique, Abid, et al.
Published: (2015) -
Accelerating SPICE Model-Evaluation using FPGAs
by: Kapre, Nachiket, et al.
Published: (2015) -
Hoplite: Building austere overlay NoCs for FPGAs
by: Kapre, Nachiket, et al.
Published: (2015) -
Parallelizing Sparse Matrix Solve for SPICE Circuit Simulation using FPGAs
by: Kapre, Nachiket, et al.
Published: (2015)