Efficient polynomial evaluation algorithm and implementation on FPGA
In this thesis, an optimized polynomial evaluation algorithm is presented. Compared to Horner's Rule which has the least number of computation steps but longest latency, or parallel evaluation methods like Estrin's method which are fast but with large hardware overhead, the proposed algori...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Theses and Dissertations |
Language: | English |
Published: |
2013
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/54869 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-54869 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-548692023-03-04T00:37:27Z Efficient polynomial evaluation algorithm and implementation on FPGA Xu, Simin Ian Vince McLoughlin School of Computer Engineering Suhaib Fahmy DRNTU::Engineering::Computer science and engineering::Hardware::Arithmetic and logic structures In this thesis, an optimized polynomial evaluation algorithm is presented. Compared to Horner's Rule which has the least number of computation steps but longest latency, or parallel evaluation methods like Estrin's method which are fast but with large hardware overhead, the proposed algorithm could achieve high level of parallelism with smallest area, by means of replacing multiplication with sqaure. To enable the performance gain for the proposed algorithm, an efficient integer squarer is proposed and implemented in FPGA with fewer DSP blocks. Previous work has presented tiling method for a double precision squarer which uses the least amount of DSP blocks so far. However it incurs a large LUT overhead and has a complex and irregular structure that it is not expandable for higher word size. The circuit proposed in this thesis can reduce the DSP block usage by an equivalent amount compared to the tiling method while incurring a much lower LUT overhead: 21.8\% fewer LUTs for a 53-bit squarer. The circuit is mapped to Xilinx Virtex 6 FPGA and evaluated for a wide range of operand word sizes, demonstrating its scalability and efficiency. With the novel squarer, the proposed polynomial algorithm exhibits 41\% latency reduction over conventional Horner's Rule for a $5^{th}$ degree polynomial with 11.9\% less area and 44.8\% latency reduction in a $4^{th}$ degree polynomial with 5\% less area on FPGA. In contrast, Estrin's method occupies 26\% and 16.5\% more area compared to Horner's Rule to achieve same level of speed improvement for the same $5^{th}$ and $4^{th}$ degree polynomial respectively. MASTER OF ENGINEERING (SCE) 2013-09-30T09:07:14Z 2013-09-30T09:07:14Z 2013 2013 Thesis Xu, S. (2013). Efficient polynomial evaluation algorithm and implementation on FPGA. Master’s thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/54869 10.32657/10356/54869 en 98 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering::Computer science and engineering::Hardware::Arithmetic and logic structures |
spellingShingle |
DRNTU::Engineering::Computer science and engineering::Hardware::Arithmetic and logic structures Xu, Simin Efficient polynomial evaluation algorithm and implementation on FPGA |
description |
In this thesis, an optimized polynomial evaluation algorithm is presented. Compared to Horner's Rule which has the least number of computation steps but longest latency, or parallel evaluation methods like Estrin's method which are fast but with large hardware overhead, the proposed algorithm could achieve high level of parallelism with smallest area, by means of replacing multiplication with sqaure. To enable the performance gain for the proposed algorithm, an efficient integer squarer is proposed and implemented in FPGA with fewer DSP blocks. Previous work has presented tiling method for a double precision squarer which uses the least amount of DSP blocks so far. However it incurs a large LUT overhead and has a complex and irregular structure that it is not expandable for higher word size. The circuit proposed in this thesis can reduce the DSP block usage by an equivalent amount compared to the tiling method while incurring a much lower LUT overhead: 21.8\% fewer LUTs for a 53-bit squarer. The circuit is mapped to Xilinx Virtex 6 FPGA and evaluated for a wide range of operand word sizes, demonstrating its scalability and efficiency. With the novel squarer, the proposed polynomial algorithm exhibits 41\% latency reduction over conventional Horner's Rule for a $5^{th}$ degree polynomial with 11.9\% less area and 44.8\% latency reduction in a $4^{th}$ degree polynomial with 5\% less area on FPGA. In contrast, Estrin's method occupies 26\% and 16.5\% more area compared to Horner's Rule to achieve same level of speed improvement for the same $5^{th}$ and $4^{th}$ degree polynomial respectively. |
author2 |
Ian Vince McLoughlin |
author_facet |
Ian Vince McLoughlin Xu, Simin |
format |
Theses and Dissertations |
author |
Xu, Simin |
author_sort |
Xu, Simin |
title |
Efficient polynomial evaluation algorithm and implementation on FPGA |
title_short |
Efficient polynomial evaluation algorithm and implementation on FPGA |
title_full |
Efficient polynomial evaluation algorithm and implementation on FPGA |
title_fullStr |
Efficient polynomial evaluation algorithm and implementation on FPGA |
title_full_unstemmed |
Efficient polynomial evaluation algorithm and implementation on FPGA |
title_sort |
efficient polynomial evaluation algorithm and implementation on fpga |
publishDate |
2013 |
url |
https://hdl.handle.net/10356/54869 |
_version_ |
1759857361668800512 |