Efficient polynomial evaluation algorithm and implementation on FPGA

In this thesis, an optimized polynomial evaluation algorithm is presented. Compared to Horner's Rule which has the least number of computation steps but longest latency, or parallel evaluation methods like Estrin's method which are fast but with large hardware overhead, the proposed algori...

Full description

Saved in:
Bibliographic Details
Main Author: Xu, Simin
Other Authors: Ian Vince McLoughlin
Format: Theses and Dissertations
Language:English
Published: 2013
Subjects:
Online Access:https://hdl.handle.net/10356/54869
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-54869
record_format dspace
spelling sg-ntu-dr.10356-548692023-03-04T00:37:27Z Efficient polynomial evaluation algorithm and implementation on FPGA Xu, Simin Ian Vince McLoughlin School of Computer Engineering Suhaib Fahmy DRNTU::Engineering::Computer science and engineering::Hardware::Arithmetic and logic structures In this thesis, an optimized polynomial evaluation algorithm is presented. Compared to Horner's Rule which has the least number of computation steps but longest latency, or parallel evaluation methods like Estrin's method which are fast but with large hardware overhead, the proposed algorithm could achieve high level of parallelism with smallest area, by means of replacing multiplication with sqaure. To enable the performance gain for the proposed algorithm, an efficient integer squarer is proposed and implemented in FPGA with fewer DSP blocks. Previous work has presented tiling method for a double precision squarer which uses the least amount of DSP blocks so far. However it incurs a large LUT overhead and has a complex and irregular structure that it is not expandable for higher word size. The circuit proposed in this thesis can reduce the DSP block usage by an equivalent amount compared to the tiling method while incurring a much lower LUT overhead: 21.8\% fewer LUTs for a 53-bit squarer. The circuit is mapped to Xilinx Virtex 6 FPGA and evaluated for a wide range of operand word sizes, demonstrating its scalability and efficiency. With the novel squarer, the proposed polynomial algorithm exhibits 41\% latency reduction over conventional Horner's Rule for a $5^{th}$ degree polynomial with 11.9\% less area and 44.8\% latency reduction in a $4^{th}$ degree polynomial with 5\% less area on FPGA. In contrast, Estrin's method occupies 26\% and 16.5\% more area compared to Horner's Rule to achieve same level of speed improvement for the same $5^{th}$ and $4^{th}$ degree polynomial respectively. MASTER OF ENGINEERING (SCE) 2013-09-30T09:07:14Z 2013-09-30T09:07:14Z 2013 2013 Thesis Xu, S. (2013). Efficient polynomial evaluation algorithm and implementation on FPGA. Master’s thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/54869 10.32657/10356/54869 en 98 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Computer science and engineering::Hardware::Arithmetic and logic structures
spellingShingle DRNTU::Engineering::Computer science and engineering::Hardware::Arithmetic and logic structures
Xu, Simin
Efficient polynomial evaluation algorithm and implementation on FPGA
description In this thesis, an optimized polynomial evaluation algorithm is presented. Compared to Horner's Rule which has the least number of computation steps but longest latency, or parallel evaluation methods like Estrin's method which are fast but with large hardware overhead, the proposed algorithm could achieve high level of parallelism with smallest area, by means of replacing multiplication with sqaure. To enable the performance gain for the proposed algorithm, an efficient integer squarer is proposed and implemented in FPGA with fewer DSP blocks. Previous work has presented tiling method for a double precision squarer which uses the least amount of DSP blocks so far. However it incurs a large LUT overhead and has a complex and irregular structure that it is not expandable for higher word size. The circuit proposed in this thesis can reduce the DSP block usage by an equivalent amount compared to the tiling method while incurring a much lower LUT overhead: 21.8\% fewer LUTs for a 53-bit squarer. The circuit is mapped to Xilinx Virtex 6 FPGA and evaluated for a wide range of operand word sizes, demonstrating its scalability and efficiency. With the novel squarer, the proposed polynomial algorithm exhibits 41\% latency reduction over conventional Horner's Rule for a $5^{th}$ degree polynomial with 11.9\% less area and 44.8\% latency reduction in a $4^{th}$ degree polynomial with 5\% less area on FPGA. In contrast, Estrin's method occupies 26\% and 16.5\% more area compared to Horner's Rule to achieve same level of speed improvement for the same $5^{th}$ and $4^{th}$ degree polynomial respectively.
author2 Ian Vince McLoughlin
author_facet Ian Vince McLoughlin
Xu, Simin
format Theses and Dissertations
author Xu, Simin
author_sort Xu, Simin
title Efficient polynomial evaluation algorithm and implementation on FPGA
title_short Efficient polynomial evaluation algorithm and implementation on FPGA
title_full Efficient polynomial evaluation algorithm and implementation on FPGA
title_fullStr Efficient polynomial evaluation algorithm and implementation on FPGA
title_full_unstemmed Efficient polynomial evaluation algorithm and implementation on FPGA
title_sort efficient polynomial evaluation algorithm and implementation on fpga
publishDate 2013
url https://hdl.handle.net/10356/54869
_version_ 1759857361668800512