Acoustic and video processing demonstration system

An acoustic and video processing demonstration system was developed. Direction of Arrival (DOA) estimation or sound source locations were overlaid on a video stream of the application, in real time. The application will implement the Steered Response Power (SRP) with Phase Transform (PHAT) or SRP-PH...

全面介紹

Saved in:
書目詳細資料
主要作者: Lim, Scott Nathaniel
其他作者: Andy Khong Wai Hoong
格式: Final Year Project
語言:English
出版: 2018
主題:
在線閱讀:http://hdl.handle.net/10356/74906
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!
機構: Nanyang Technological University
語言: English
實物特徵
總結:An acoustic and video processing demonstration system was developed. Direction of Arrival (DOA) estimation or sound source locations were overlaid on a video stream of the application, in real time. The application will implement the Steered Response Power (SRP) with Phase Transform (PHAT) or SRP-PHAT algorithm to generate power values from audio data of 40 channels. In general, the global maximum of these power values relate to the DOA. This technique was used due to its compatibility with Generalised Cross Correlation (GCC) and resilience to background noise. However, searching the power values from SRP-PHAT poses issues of long processing time and tendencies to peak at undesired local maxima. Thus, the Stochastic Region Contraction (SRC) method was employed for its performance improvements and effectiveness in searching for the power values’ global maxima. The algorithm was adapted from a MATLAB source, but due to software restrictions of the 40 channel microphone array, a conversion of MATLAB code to C++ was required. This code was converted manually. It was found that the converted code was not as reliable in providing accurate DOA results. Comparing Cosine Similarity Indexes (CSI) showed that the converted code produced a weak CSI of 57% similarity. Whereas, the MATLAB code had a high CSI of 87% similarity to a true DOA output. The low CSI percentage of the converted code in determining true DOAs, had justified the presence of conversion error. Hence, it was recommended that an automatic converter be emplaced, to avoid conversion errors.