FPGA implementation of a speech coder

In this project, one of the speech compression methods, a LPC 10 decoder was designed by using VHDL. In the designed LPC decoder, two different types of speech production model were designed and used to compare their performances. The first model used two adders, two multipliers and one substractor...

Full description

Saved in:
Bibliographic Details
Main Author: Tay, Zar Myint.
Other Authors: Ho Duan Juat
Format: Final Year Project
Language:English
Published: 2010
Subjects:
Online Access:http://hdl.handle.net/10356/40567
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-40567
record_format dspace
spelling sg-ntu-dr.10356-405672023-07-07T16:12:59Z FPGA implementation of a speech coder Tay, Zar Myint. Ho Duan Juat School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing In this project, one of the speech compression methods, a LPC 10 decoder was designed by using VHDL. In the designed LPC decoder, two different types of speech production model were designed and used to compare their performances. The first model used two adders, two multipliers and one substractor and the second one used four multipliers, four adders and one substractor. It was found that the first model took three cycles more than the second one to complete the one sample. On the other hand, the first model only used total 19778 gates while the second one used 28901 gates. Therefore the second model took more resources. After completing the speech production model, real speech parameters are used to feed into the speech production model and simulated to generate synthesis speech. The reconstructed speech signal was able to achieve the original signal at reasonable quality. In order to analyze the performance of the designed speech production model, another speech production model was created by using Microsoft Excel, which used exact fraction value of LPC parameters and the output speech signal was used as a model answer of a LPC speech production. The LPC parameters such as coefficients, pitch, vice information and gain were estimated by using MatLab. After feeding parameters into speech production model, the result speech signals are put in Matlab to generate the waveform and carried out the analyzing and comparison. Comparing to original speech, the pitches of simulated signal and amplitudes were different from original due to constant value of estimated parameter value in 20ms frame. The error was analyzed and found that the error occurred due to binary representation. Bachelor of Engineering 2010-06-16T07:54:04Z 2010-06-16T07:54:04Z 2010 2010 Final Year Project (FYP) http://hdl.handle.net/10356/40567 en Nanyang Technological University 96 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing
spellingShingle DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing
Tay, Zar Myint.
FPGA implementation of a speech coder
description In this project, one of the speech compression methods, a LPC 10 decoder was designed by using VHDL. In the designed LPC decoder, two different types of speech production model were designed and used to compare their performances. The first model used two adders, two multipliers and one substractor and the second one used four multipliers, four adders and one substractor. It was found that the first model took three cycles more than the second one to complete the one sample. On the other hand, the first model only used total 19778 gates while the second one used 28901 gates. Therefore the second model took more resources. After completing the speech production model, real speech parameters are used to feed into the speech production model and simulated to generate synthesis speech. The reconstructed speech signal was able to achieve the original signal at reasonable quality. In order to analyze the performance of the designed speech production model, another speech production model was created by using Microsoft Excel, which used exact fraction value of LPC parameters and the output speech signal was used as a model answer of a LPC speech production. The LPC parameters such as coefficients, pitch, vice information and gain were estimated by using MatLab. After feeding parameters into speech production model, the result speech signals are put in Matlab to generate the waveform and carried out the analyzing and comparison. Comparing to original speech, the pitches of simulated signal and amplitudes were different from original due to constant value of estimated parameter value in 20ms frame. The error was analyzed and found that the error occurred due to binary representation.
author2 Ho Duan Juat
author_facet Ho Duan Juat
Tay, Zar Myint.
format Final Year Project
author Tay, Zar Myint.
author_sort Tay, Zar Myint.
title FPGA implementation of a speech coder
title_short FPGA implementation of a speech coder
title_full FPGA implementation of a speech coder
title_fullStr FPGA implementation of a speech coder
title_full_unstemmed FPGA implementation of a speech coder
title_sort fpga implementation of a speech coder
publishDate 2010
url http://hdl.handle.net/10356/40567
_version_ 1772828085594357760