首页> 外文会议>Workshop on Computational Linguistics and Clinical Psychology >Evaluating Automatic Speech Recognition Quality and Its Impact on Counselor Utterance Coding
【24h】

Evaluating Automatic Speech Recognition Quality and Its Impact on Counselor Utterance Coding

机译:自动语音识别质量评估及其对辅导员话语编码的影响

获取原文

摘要

Automatic speech recognition (ASR) is a crucial step in many natural language processing (NLP) applications, as often available data consists mainly of raw speech. Since the result of the ASR step is considered as a meaningful, informative input to later steps in the NLP pipeline, it is important to understand the behavior and failure mode of this step. In this work, we analyze the quality of ASR in the psychotherapy domain, using motivational interviewing conversations between therapists and clients. We conduct domain agnostic and domain-relevant evaluations using evaluation metrics and also identify domain-relevant keywords in the ASR output. Moreover, we empirically study the effect of mixing ASR and manual data during the training of a downstream NLP model, and also demonstrate how additional local context can help alleviate the error introduced by noisy ASR transcripts.
机译:自动语音识别(ASR)是许多自然语言处理(NLP)应用中的关键步骤,因为通常可用的数据主要由原始语音组成。由于ASR步骤的结果被视为NLP管道中后续步骤的有意义的信息输入,因此了解该步骤的行为和故障模式非常重要。在这项工作中,我们使用治疗师和客户之间的动机式访谈对话,分析心理治疗领域ASR的质量。我们使用评估指标进行领域无关和领域相关评估,并在ASR输出中识别领域相关关键字。此外,我们还实证研究了在下游NLP模型的训练过程中混合ASR和人工数据的效果,并展示了额外的局部环境如何有助于缓解由嘈杂的ASR转录本引入的错误。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号