Efficient polynomial evaluation algorithm and implementation on FPGA

In this thesis, an optimized polynomial evaluation algorithm is presented. Compared to Horner's Rule which has the least number of computation steps but longest latency, or parallel evaluation methods like Estrin's method which are fast but with large hardware overhead, the proposed algori...

Full description

Saved in:

Bibliographic Details
Main Author:	Xu, Simin
Other Authors:	Ian Vince McLoughlin
Format:	Theses and Dissertations
Language:	English
Published:	2013
Subjects:	DRNTU::Engineering::Computer science and engineering::Hardware::Arithmetic and logic structures
Online Access:	https://hdl.handle.net/10356/54869
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-54869
record_format	dspace
spelling	sg-ntu-dr.10356-548692023-03-04T00:37:27Z Efficient polynomial evaluation algorithm and implementation on FPGA Xu, Simin Ian Vince McLoughlin School of Computer Engineering Suhaib Fahmy DRNTU::Engineering::Computer science and engineering::Hardware::Arithmetic and logic structures In this thesis, an optimized polynomial evaluation algorithm is presented. Compared to Horner's Rule which has the least number of computation steps but longest latency, or parallel evaluation methods like Estrin's method which are fast but with large hardware overhead, the proposed algorithm could achieve high level of parallelism with smallest area, by means of replacing multiplication with sqaure. To enable the performance gain for the proposed algorithm, an efficient integer squarer is proposed and implemented in FPGA with fewer DSP blocks. Previous work has presented tiling method for a double precision squarer which uses the least amount of DSP blocks so far. However it incurs a large LUT overhead and has a complex and irregular structure that it is not expandable for higher word size. The circuit proposed in this thesis can reduce the DSP block usage by an equivalent amount compared to the tiling method while incurring a much lower LUT overhead: 21.8\% fewer LUTs for a 53-bit squarer. The circuit is mapped to Xilinx Virtex 6 FPGA and evaluated for a wide range of operand word sizes, demonstrating its scalability and efficiency. With the novel squarer, the proposed polynomial algorithm exhibits 41\% latency reduction over conventional Horner's Rule for a $5^{th}$ degree polynomial with 11.9\% less area and 44.8\% latency reduction in a $4^{th}$ degree polynomial with 5\% less area on FPGA. In contrast, Estrin's method occupies 26\% and 16.5\% more area compared to Horner's Rule to achieve same level of speed improvement for the same $5^{th}$ and $4^{th}$ degree polynomial respectively. MASTER OF ENGINEERING (SCE) 2013-09-30T09:07:14Z 2013-09-30T09:07:14Z 2013 2013 Thesis Xu, S. (2013). Efficient polynomial evaluation algorithm and implementation on FPGA. Master’s thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/54869 10.32657/10356/54869 en 98 p. application/pdf
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	DRNTU::Engineering::Computer science and engineering::Hardware::Arithmetic and logic structures
spellingShingle	DRNTU::Engineering::Computer science and engineering::Hardware::Arithmetic and logic structures Xu, Simin Efficient polynomial evaluation algorithm and implementation on FPGA
description	In this thesis, an optimized polynomial evaluation algorithm is presented. Compared to Horner's Rule which has the least number of computation steps but longest latency, or parallel evaluation methods like Estrin's method which are fast but with large hardware overhead, the proposed algorithm could achieve high level of parallelism with smallest area, by means of replacing multiplication with sqaure. To enable the performance gain for the proposed algorithm, an efficient integer squarer is proposed and implemented in FPGA with fewer DSP blocks. Previous work has presented tiling method for a double precision squarer which uses the least amount of DSP blocks so far. However it incurs a large LUT overhead and has a complex and irregular structure that it is not expandable for higher word size. The circuit proposed in this thesis can reduce the DSP block usage by an equivalent amount compared to the tiling method while incurring a much lower LUT overhead: 21.8\% fewer LUTs for a 53-bit squarer. The circuit is mapped to Xilinx Virtex 6 FPGA and evaluated for a wide range of operand word sizes, demonstrating its scalability and efficiency. With the novel squarer, the proposed polynomial algorithm exhibits 41\% latency reduction over conventional Horner's Rule for a $5^{th}$ degree polynomial with 11.9\% less area and 44.8\% latency reduction in a $4^{th}$ degree polynomial with 5\% less area on FPGA. In contrast, Estrin's method occupies 26\% and 16.5\% more area compared to Horner's Rule to achieve same level of speed improvement for the same $5^{th}$ and $4^{th}$ degree polynomial respectively.
author2	Ian Vince McLoughlin
author_facet	Ian Vince McLoughlin Xu, Simin
format	Theses and Dissertations
author	Xu, Simin
author_sort	Xu, Simin
title	Efficient polynomial evaluation algorithm and implementation on FPGA
title_short	Efficient polynomial evaluation algorithm and implementation on FPGA
title_full	Efficient polynomial evaluation algorithm and implementation on FPGA
title_fullStr	Efficient polynomial evaluation algorithm and implementation on FPGA
title_full_unstemmed	Efficient polynomial evaluation algorithm and implementation on FPGA
title_sort	efficient polynomial evaluation algorithm and implementation on fpga
publishDate	2013
url	https://hdl.handle.net/10356/54869
_version_	1759857361668800512

Efficient polynomial evaluation algorithm and implementation on FPGA

Similar Items