Fanout decomposition dataflow optimizations for FPGA-based Sparse LU factorization

Performance of FPGA-based token dataflow architectures is often limited by the long tail distribution of parallelism in the compute paths of the dataflow graphs. This is known to limit speedup of dataflow processing of Sparse LU factorization to only 3-10x over CPUs. One reason behind the limitation...

全面介紹

Saved in:
書目詳細資料
Main Authors: Siddhartha, Kapre, Nachiket
其他作者: School of Computer Engineering
格式: Conference or Workshop Item
語言:English
出版: 2015
主題:
在線閱讀:https://hdl.handle.net/10356/81207
http://hdl.handle.net/10220/39179
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!
機構: Nanyang Technological University
語言: English