ASDF: A Differential testing framework for automatic speech recognition systems

Recent years have witnessed wider adoption of Automated Speech Recognition (ASR) techniques in various domains. Consequently, evaluating and enhancing the quality of ASR systems is of great importance. This paper proposes Asdf, an Automated Speech Recognition Differential Testing Framework to test A...

Full description

Saved in:
Bibliographic Details
Main Authors: YUEN, Daniel Hao Xian, PANG, Andrew Yong Chen, YANG, Zhou, CHONG, Chun Yong, LIM, Mei Kuan, LO, David
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2023
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/8565
https://ink.library.smu.edu.sg/context/sis_research/article/9568/viewcontent/ASDF__A_Differential_Testing_Framework_for_Automatic_Speech_Recognition_Systems.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-9568
record_format dspace
spelling sg-smu-ink.sis_research-95682024-01-25T09:01:48Z ASDF: A Differential testing framework for automatic speech recognition systems YUEN, Daniel Hao Xian PANG, Andrew Yong Chen YANG, Zhou CHONG, Chun Yong LIM, Mei Kuan LO, David Recent years have witnessed wider adoption of Automated Speech Recognition (ASR) techniques in various domains. Consequently, evaluating and enhancing the quality of ASR systems is of great importance. This paper proposes Asdf, an Automated Speech Recognition Differential Testing Framework to test ASR systems. Asdf extends an existing ASR testing tool, the CrossASR++, which synthesizes test cases from a text corpus. However, CrossASR++ fails to make use of the text corpus efficiently and provides limited information on how the failed test cases can improve ASR systems. To address these limitations, our tool incorporates two novel features: (1) a text transformation module to boost the number of generated test cases and uncover more errors in ASR systems, and (2) a phonetic analysis module to identify phonemes that the ASR systems tend to transcribe incorrectly. Asdf generates more high-quality test cases by applying various text transformation methods (e.g., changing tense) to the input text in a failed test case. By doing so, Asdf can utilize a small text corpus to generate a large number of audio test cases, something which CrossASR++ is not capable of. In addition, Asdf implements more metrics to evaluate the performance of ASR systems from multiple perspectives. Asdf performs phonetic analysis on the identified failed test cases to identify the phonemes that ASR systems tend to transcribe incorrectly, providing useful information for developers to improve ASR systems. The demonstration video of our tool is made online at https://www.youtube.com/watch?v=DzVwfc3h9As. The implementation is available at https://github.com/danielyuenhx/asdf-differential-testing. 2023-04-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/8565 info:doi/10.1109/ICST57152.2023.00050 https://ink.library.smu.edu.sg/context/sis_research/article/9568/viewcontent/ASDF__A_Differential_Testing_Framework_for_Automatic_Speech_Recognition_Systems.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Automated speech recognition Automatic speech recognition system Differential testing Limited information Phonetic analysis Speech recognition systems Test case Testing framework Testing tools Text corpora Databases and Information Systems
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Automated speech recognition
Automatic speech recognition system
Differential testing
Limited information
Phonetic analysis
Speech recognition systems
Test case
Testing framework
Testing tools
Text corpora
Databases and Information Systems
spellingShingle Automated speech recognition
Automatic speech recognition system
Differential testing
Limited information
Phonetic analysis
Speech recognition systems
Test case
Testing framework
Testing tools
Text corpora
Databases and Information Systems
YUEN, Daniel Hao Xian
PANG, Andrew Yong Chen
YANG, Zhou
CHONG, Chun Yong
LIM, Mei Kuan
LO, David
ASDF: A Differential testing framework for automatic speech recognition systems
description Recent years have witnessed wider adoption of Automated Speech Recognition (ASR) techniques in various domains. Consequently, evaluating and enhancing the quality of ASR systems is of great importance. This paper proposes Asdf, an Automated Speech Recognition Differential Testing Framework to test ASR systems. Asdf extends an existing ASR testing tool, the CrossASR++, which synthesizes test cases from a text corpus. However, CrossASR++ fails to make use of the text corpus efficiently and provides limited information on how the failed test cases can improve ASR systems. To address these limitations, our tool incorporates two novel features: (1) a text transformation module to boost the number of generated test cases and uncover more errors in ASR systems, and (2) a phonetic analysis module to identify phonemes that the ASR systems tend to transcribe incorrectly. Asdf generates more high-quality test cases by applying various text transformation methods (e.g., changing tense) to the input text in a failed test case. By doing so, Asdf can utilize a small text corpus to generate a large number of audio test cases, something which CrossASR++ is not capable of. In addition, Asdf implements more metrics to evaluate the performance of ASR systems from multiple perspectives. Asdf performs phonetic analysis on the identified failed test cases to identify the phonemes that ASR systems tend to transcribe incorrectly, providing useful information for developers to improve ASR systems. The demonstration video of our tool is made online at https://www.youtube.com/watch?v=DzVwfc3h9As. The implementation is available at https://github.com/danielyuenhx/asdf-differential-testing.
format text
author YUEN, Daniel Hao Xian
PANG, Andrew Yong Chen
YANG, Zhou
CHONG, Chun Yong
LIM, Mei Kuan
LO, David
author_facet YUEN, Daniel Hao Xian
PANG, Andrew Yong Chen
YANG, Zhou
CHONG, Chun Yong
LIM, Mei Kuan
LO, David
author_sort YUEN, Daniel Hao Xian
title ASDF: A Differential testing framework for automatic speech recognition systems
title_short ASDF: A Differential testing framework for automatic speech recognition systems
title_full ASDF: A Differential testing framework for automatic speech recognition systems
title_fullStr ASDF: A Differential testing framework for automatic speech recognition systems
title_full_unstemmed ASDF: A Differential testing framework for automatic speech recognition systems
title_sort asdf: a differential testing framework for automatic speech recognition systems
publisher Institutional Knowledge at Singapore Management University
publishDate 2023
url https://ink.library.smu.edu.sg/sis_research/8565
https://ink.library.smu.edu.sg/context/sis_research/article/9568/viewcontent/ASDF__A_Differential_Testing_Framework_for_Automatic_Speech_Recognition_Systems.pdf
_version_ 1789483277145341952