ASIC implementation of counter-based 1-D DCT processor

This project proposes a new design of 8x8 1-D Discrete Cosine Transform (DCT) unit. It avoids using multipliers for inner products by employing the counter-based inner product architecture. As the inner products for DCT are constant-variable multiplications, redundancies are exploited to optimize th...

Full description

Saved in:
Bibliographic Details
Main Author: Zhang, Li
Other Authors: Chang Chip Hong
Format: Final Year Project
Language:English
Published: 2010
Subjects:
Online Access:http://hdl.handle.net/10356/40744
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-40744
record_format dspace
spelling sg-ntu-dr.10356-407442023-07-07T15:49:24Z ASIC implementation of counter-based 1-D DCT processor Zhang, Li Chang Chip Hong School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering::Electronic circuits This project proposes a new design of 8x8 1-D Discrete Cosine Transform (DCT) unit. It avoids using multipliers for inner products by employing the counter-based inner product architecture. As the inner products for DCT are constant-variable multiplications, redundancies are exploited to optimize the architecture further. This report discusses the entire flow for the ASIC implementation process, from HDL coding and functional simulations to synthesis and place & routing of the design. Timing analysis is also done for design verification. The counter-based inner product architecture is the major part of the design, which can be seen as serial architecture for multiplication. It uses counters to store values of partial product matrix from inner products, which transforms L vertical bits into ⌊log2L⌋+1 horizontal bits of the accumulated matrix and thus reduces the matrix height drastically. This results in much fewer adders used in the reduction stage and less hardware required. On the other hand, as the counters can operate at a frequency of a few GHz, the accumulation process can be performed quite fast and the throughput of this design is comparable with many existing parallel inner-product computation architectures. The 1-D DCT is designed to contain 8 sets of such architectures, each of which is used to deal with one inner product required in the forward DCT computation. After placement & routing process, the reported minimum clock period for the design is 0.5 ns. The design has an initial latency of 18 clock cycles in pipelining, and the pipelined technique used inside the counter-based architecture enables our proposed 1-D DCT architecture to output the eight 1-D DCT transformed domain values every 9 clock cycles (4.5 ns). Bachelor of Engineering 2010-06-21T06:10:23Z 2010-06-21T06:10:23Z 2010 2010 Final Year Project (FYP) http://hdl.handle.net/10356/40744 en Nanyang Technological University 74 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Electrical and electronic engineering::Electronic circuits
spellingShingle DRNTU::Engineering::Electrical and electronic engineering::Electronic circuits
Zhang, Li
ASIC implementation of counter-based 1-D DCT processor
description This project proposes a new design of 8x8 1-D Discrete Cosine Transform (DCT) unit. It avoids using multipliers for inner products by employing the counter-based inner product architecture. As the inner products for DCT are constant-variable multiplications, redundancies are exploited to optimize the architecture further. This report discusses the entire flow for the ASIC implementation process, from HDL coding and functional simulations to synthesis and place & routing of the design. Timing analysis is also done for design verification. The counter-based inner product architecture is the major part of the design, which can be seen as serial architecture for multiplication. It uses counters to store values of partial product matrix from inner products, which transforms L vertical bits into ⌊log2L⌋+1 horizontal bits of the accumulated matrix and thus reduces the matrix height drastically. This results in much fewer adders used in the reduction stage and less hardware required. On the other hand, as the counters can operate at a frequency of a few GHz, the accumulation process can be performed quite fast and the throughput of this design is comparable with many existing parallel inner-product computation architectures. The 1-D DCT is designed to contain 8 sets of such architectures, each of which is used to deal with one inner product required in the forward DCT computation. After placement & routing process, the reported minimum clock period for the design is 0.5 ns. The design has an initial latency of 18 clock cycles in pipelining, and the pipelined technique used inside the counter-based architecture enables our proposed 1-D DCT architecture to output the eight 1-D DCT transformed domain values every 9 clock cycles (4.5 ns).
author2 Chang Chip Hong
author_facet Chang Chip Hong
Zhang, Li
format Final Year Project
author Zhang, Li
author_sort Zhang, Li
title ASIC implementation of counter-based 1-D DCT processor
title_short ASIC implementation of counter-based 1-D DCT processor
title_full ASIC implementation of counter-based 1-D DCT processor
title_fullStr ASIC implementation of counter-based 1-D DCT processor
title_full_unstemmed ASIC implementation of counter-based 1-D DCT processor
title_sort asic implementation of counter-based 1-d dct processor
publishDate 2010
url http://hdl.handle.net/10356/40744
_version_ 1772825402160447488