Evaluating Automatic Speech Recognition Quality and Its Impact on Counselor Utterance Coding

机译：自动语音识别质量评估及其对辅导员话语编码的影响

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Automatic speech recognition (ASR) is a crucial step in many natural language processing (NLP) applications, as often available data consists mainly of raw speech. Since the result of the ASR step is considered as a meaningful, informative input to later steps in the NLP pipeline, it is important to understand the behavior and failure mode of this step. In this work, we analyze the quality of ASR in the psychotherapy domain, using motivational interviewing conversations between therapists and clients. We conduct domain agnostic and domain-relevant evaluations using evaluation metrics and also identify domain-relevant keywords in the ASR output. Moreover, we empirically study the effect of mixing ASR and manual data during the training of a downstream NLP model, and also demonstrate how additional local context can help alleviate the error introduced by noisy ASR transcripts.

机译：自动语音识别（ASR）是许多自然语言处理（NLP）应用中的关键步骤，因为通常可用的数据主要由原始语音组成。由于ASR步骤的结果被视为NLP管道中后续步骤的有意义的信息输入，因此了解该步骤的行为和故障模式非常重要。在这项工作中，我们使用治疗师和客户之间的动机式访谈对话，分析心理治疗领域ASR的质量。我们使用评估指标进行领域无关和领域相关评估，并在ASR输出中识别领域相关关键字。此外，我们还实证研究了在下游NLP模型的训练过程中混合ASR和人工数据的效果，并展示了额外的局部环境如何有助于缓解由嘈杂的ASR转录本引入的错误。

著录项

来源
《Workshop on Computational Linguistics and Clinical Psychology》|2021年|159-168|共10页
会议地点
作者
Do June Min; Veronica Perez-Rosas; Rada Mihalcea;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Impact of Languages and Accent on Perceived Speech Quality Predicted by Perceptual Evaluation of Speech Quality (PESQ) and Perceptual Objective Listening Quality Assessment (POLQA): Case of Moore, Dioula, French and English [J] . Daouda Konane, Sibiri Tiemounou, Wend Yam Serge Boris Ouedraogo 应用科学（英文） . 2021,第012期
2. Investigation of Automatic Speech Recognition Systems via the Multilingual Deep Neural Network Modeling Methods for a Very Low-Resource Language, Chaha [J] . Tessfu Geteye Fantaye, Junqing Yu, Tulu Tilahun Hailu 信号与信息处理（英文） . 2020,第001期
3. Agglutinative Language Speech Recognition Using Automatic Allophone Deriving [J] . XU Ji, PAN Jielin, YAN Yonghong 电子学报（英文版） . 2016,第002期
4. Agglutinative Language Speech Recognition Using Automatic Allophone Deriving [J] . XU Ji, PAN Jielin, YAN Yonghong 电子学报：英文版 . 2016,第002期
5. Attention and Feature Selection for Automatic Speech Emotion Recognition Using Utterance and Syllable-Level Prosodic Features [J] . Ben Alex Starlet, Mary Leena, Babu Ben P. Circuits, systems and signal processing . 2020,第11期

机译：用话语和音节级韵律特征对自动语音情感识别的关注和特征选择
6. Automatic Speech Recognition of English-isiZulu Code-switched Speech from South African Soap Operas [J] . Ewald van der Westhuizen, Thomas Niesler Procedia Computer Science . 2016,第22期

机译：南非肥皂剧中英语-西祖鲁语代码转换语音的自动语音识别
7. Single-Ended Speech Quality Prediction Based on Automatic Speech Recognition [J] . RAINER HUBER, JASPER OOSTER, BERND T. MEYER Journal of the Audio Engineering Society . 2018,第10期

机译：基于语音自动识别的单端语音质量预测
8. Impact of a Newly Developed Modern Standard Arabic Speech Corpus on Implementing and Evaluating Automatic Continuous Speech Recognition Systems [C] . Mohammad A.M. Abushariah, Raja N. Ainon, Roziati Zainuddin, Spoken dialogue systems for ambient environments . 2010

机译：新开发的现代标准阿拉伯语语音语料库对实施和评估自动连续语音识别系统的影响
9. Code breaking for automatic speech recognition. [D] . Venkataramani, Veera. 2005

机译：用于自动语音识别的密码破解。
10. Automatic speech recognition in the operating room – An essential contemporary tool or a redundant gadget? A survey evaluation among physicians in form of a qualitative study [O] . Antonia Schulte, Rodrigo Suarez-Ibarrola, Daniel Wegen, 2020

机译：手术室自动语音识别 - 必不可少的当代工具或冗余小工具？质量研究形式的医生调查评估
11. Automatic speech recognition of Cantonese-English code-mixing utterances. [O] . 2005

机译：automatic speech recognition of Cantonese-English code-mixing utterances.
12. Objective Speech Quality Evaluation of Real-Time Speech Coders [R] . Viswanathan, V. R., Russell, W. H., Huggins, A. W. F. 1984

机译：实时语音编码器的客观语音质量评估

Evaluating Automatic Speech Recognition Quality and Its Impact on Counselor Utterance Coding

摘要

著录项

相似文献

相关主题

期刊订阅