CrossASR++: A modular differential testing framework for automatic speech recognition
Developers need to perform adequate testing to ensure the quality of Automatic Speech Recognition (ASR) systems. However, manually collecting required test cases is tedious and time-consuming. Our recent work proposes CrossASR, a differential testing method for ASR systems. This method first utilize...
Saved in:
Main Authors: | , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2021
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/6864 https://ink.library.smu.edu.sg/context/sis_research/article/7867/viewcontent/CrossASR_a_modular_differential_testing_framework_for_automatic_speech_recognition.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-7867 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-78672022-02-07T11:14:13Z CrossASR++: A modular differential testing framework for automatic speech recognition ASYROFI, Muhammad Hilmi YANG, Zhou LO, David Developers need to perform adequate testing to ensure the quality of Automatic Speech Recognition (ASR) systems. However, manually collecting required test cases is tedious and time-consuming. Our recent work proposes CrossASR, a differential testing method for ASR systems. This method first utilizes Text-to-Speech (TTS) to generate audios from texts automatically and then feed these audios into different ASR systems for cross-referencing to uncover failed test cases. It also leverages a failure estimator to find failing test cases more efficiently. Such a method is inherently self-improvable: the performance can increase by leveraging more advanced TTS and ASR systems. So, in this accompanying tool demo paper, we further engineer CrossASR and propose CrossASR++, an easy-to-use ASR testing tool that can be conveniently extended to incorporate different TTS and ASR systems, and failure estimators. We also make CrossASR++ chunk texts from a given corpus dynamically and enable the estimator to work in a more effective and flexible way. We demonstrate that the new features can help CrossASR++ discover more failed test cases. Using the same TTS and ASR systems, CrossASR++ can uncover 26.2% more failed test cases for 4 ASRs than the original tool. Moreover, by simply adding one more ASR for cross-referencing, we can increase the number of failed test cases uncovered for each of the 4 ASR systems by 25.07%, 39.63%, 20.95% and 8.17% respectively. We also extend CrossASR++ with 5 additional failure estimators. Compared to worst estimator, the best one can discover 10.41% more failed test cases within the same amount of time. The demo video for CrossASR++ can be viewed at https://youtu.be/ddRk-f0QV-g and the source code can be found at https://github.com/soarsmu/CrossASRplus. 2021-08-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/6864 info:doi/10.1145/3468264.3473124 https://ink.library.smu.edu.sg/context/sis_research/article/7867/viewcontent/CrossASR_a_modular_differential_testing_framework_for_automatic_speech_recognition.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University automatic speech recognition cross-referencing test case generation text-to-speech Artificial Intelligence and Robotics Databases and Information Systems |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
automatic speech recognition cross-referencing test case generation text-to-speech Artificial Intelligence and Robotics Databases and Information Systems |
spellingShingle |
automatic speech recognition cross-referencing test case generation text-to-speech Artificial Intelligence and Robotics Databases and Information Systems ASYROFI, Muhammad Hilmi YANG, Zhou LO, David CrossASR++: A modular differential testing framework for automatic speech recognition |
description |
Developers need to perform adequate testing to ensure the quality of Automatic Speech Recognition (ASR) systems. However, manually collecting required test cases is tedious and time-consuming. Our recent work proposes CrossASR, a differential testing method for ASR systems. This method first utilizes Text-to-Speech (TTS) to generate audios from texts automatically and then feed these audios into different ASR systems for cross-referencing to uncover failed test cases. It also leverages a failure estimator to find failing test cases more efficiently. Such a method is inherently self-improvable: the performance can increase by leveraging more advanced TTS and ASR systems. So, in this accompanying tool demo paper, we further engineer CrossASR and propose CrossASR++, an easy-to-use ASR testing tool that can be conveniently extended to incorporate different TTS and ASR systems, and failure estimators. We also make CrossASR++ chunk texts from a given corpus dynamically and enable the estimator to work in a more effective and flexible way. We demonstrate that the new features can help CrossASR++ discover more failed test cases. Using the same TTS and ASR systems, CrossASR++ can uncover 26.2% more failed test cases for 4 ASRs than the original tool. Moreover, by simply adding one more ASR for cross-referencing, we can increase the number of failed test cases uncovered for each of the 4 ASR systems by 25.07%, 39.63%, 20.95% and 8.17% respectively. We also extend CrossASR++ with 5 additional failure estimators. Compared to worst estimator, the best one can discover 10.41% more failed test cases within the same amount of time. The demo video for CrossASR++ can be viewed at https://youtu.be/ddRk-f0QV-g and the source code can be found at https://github.com/soarsmu/CrossASRplus. |
format |
text |
author |
ASYROFI, Muhammad Hilmi YANG, Zhou LO, David |
author_facet |
ASYROFI, Muhammad Hilmi YANG, Zhou LO, David |
author_sort |
ASYROFI, Muhammad Hilmi |
title |
CrossASR++: A modular differential testing framework for automatic speech recognition |
title_short |
CrossASR++: A modular differential testing framework for automatic speech recognition |
title_full |
CrossASR++: A modular differential testing framework for automatic speech recognition |
title_fullStr |
CrossASR++: A modular differential testing framework for automatic speech recognition |
title_full_unstemmed |
CrossASR++: A modular differential testing framework for automatic speech recognition |
title_sort |
crossasr++: a modular differential testing framework for automatic speech recognition |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2021 |
url |
https://ink.library.smu.edu.sg/sis_research/6864 https://ink.library.smu.edu.sg/context/sis_research/article/7867/viewcontent/CrossASR_a_modular_differential_testing_framework_for_automatic_speech_recognition.pdf |
_version_ |
1770576109265485824 |