Digital audio/speech forensics

MPEG-1 Audio Layer 3 (MP3) is one of or if not, the most common audio media format these days and with advancement in technology, portable devices like MP3 players or digital voice recorders can easily record MP3 while on the go. Sometimes, these MP3 recordings might be important evidences that need...

Full description

Saved in:
Bibliographic Details
Main Author: Tok, De Fang.
Other Authors: Sabu Emmanuel
Format: Final Year Project
Language:English
Published: 2010
Subjects:
Online Access:http://hdl.handle.net/10356/38531
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-38531
record_format dspace
spelling sg-ntu-dr.10356-385312023-03-03T20:32:32Z Digital audio/speech forensics Tok, De Fang. Sabu Emmanuel School of Computer Engineering Centre for Multimedia and Network Technology DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition MPEG-1 Audio Layer 3 (MP3) is one of or if not, the most common audio media format these days and with advancement in technology, portable devices like MP3 players or digital voice recorders can easily record MP3 while on the go. Sometimes, these MP3 recordings might be important evidences that need to be presented in a court hearing. The challenge therefore is how to ascertain the audio file authenticity especially when MP3 files can be doctored easily with pervasive audio editing software. The purpose of this project is to implement certain techniques that can be used to detect forgeries, namely – deletion, insertion and substitution, in audio files. The project took reference from the paper by Rui Yang on detecting of forgeries in MP3 files via frame offsets. His algorithm based on LAME MP3 codec was tested on files doctored with the different forms of forgeries. After the testing, a smoothing technique based on the median filter concept was implemented to achieve better results of reducing the unwanted spikes and troughs present due to padding of zeroes by the codec, noise or silent frames. Next a novel method to determine the frame offset corresponding to the initial MP3 encoding of the file was introduced. Even though the technique was unable to detect any points of forgery, it will be able to detect different offsets from a file that has been encoded multiple times highlighting the fact it has been doctored before as non doctored files will only give rise to one single offset even after multiple encodings. As the method is brief, it might be useful to look further into detection of forgery in audios files that has been encoded multiple times. Bachelor of Engineering (Computer Engineering) 2010-05-10T07:34:09Z 2010-05-10T07:34:09Z 2010 2010 Final Year Project (FYP) http://hdl.handle.net/10356/38531 en Nanyang Technological University 75 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition
spellingShingle DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition
Tok, De Fang.
Digital audio/speech forensics
description MPEG-1 Audio Layer 3 (MP3) is one of or if not, the most common audio media format these days and with advancement in technology, portable devices like MP3 players or digital voice recorders can easily record MP3 while on the go. Sometimes, these MP3 recordings might be important evidences that need to be presented in a court hearing. The challenge therefore is how to ascertain the audio file authenticity especially when MP3 files can be doctored easily with pervasive audio editing software. The purpose of this project is to implement certain techniques that can be used to detect forgeries, namely – deletion, insertion and substitution, in audio files. The project took reference from the paper by Rui Yang on detecting of forgeries in MP3 files via frame offsets. His algorithm based on LAME MP3 codec was tested on files doctored with the different forms of forgeries. After the testing, a smoothing technique based on the median filter concept was implemented to achieve better results of reducing the unwanted spikes and troughs present due to padding of zeroes by the codec, noise or silent frames. Next a novel method to determine the frame offset corresponding to the initial MP3 encoding of the file was introduced. Even though the technique was unable to detect any points of forgery, it will be able to detect different offsets from a file that has been encoded multiple times highlighting the fact it has been doctored before as non doctored files will only give rise to one single offset even after multiple encodings. As the method is brief, it might be useful to look further into detection of forgery in audios files that has been encoded multiple times.
author2 Sabu Emmanuel
author_facet Sabu Emmanuel
Tok, De Fang.
format Final Year Project
author Tok, De Fang.
author_sort Tok, De Fang.
title Digital audio/speech forensics
title_short Digital audio/speech forensics
title_full Digital audio/speech forensics
title_fullStr Digital audio/speech forensics
title_full_unstemmed Digital audio/speech forensics
title_sort digital audio/speech forensics
publishDate 2010
url http://hdl.handle.net/10356/38531
_version_ 1759855395340288000