首页> 外文期刊>Interacting with Computers >Third-party error detection support mechanisms for dictation speech recognition
【24h】

Third-party error detection support mechanisms for dictation speech recognition

机译:用于听写语音识别的第三方错误检测支持机制

获取原文
获取原文并翻译 | 示例
           

摘要

Although speech recognition has improved significantly in recent years, its adoption continues to be limited, in part, by the effort and frustration associated with correcting speech recognition errors. Error detection is a particularly challenging issue in third-party error correction where different individuals are responsible for the original dictation and correcting the resulting text. This research aims to address the difficulty experienced in third-party error detection by developing and evaluating a variety of support mechanisms. Drawing on a growing body of literature on human computer interaction and speech recognition, four support mechanisms were designed and evaluated, namely indexed audio, speech summarization, error prediction, and the presentation of alternative hypotheses. A user study assessed the impact of these support mechanisms on both performance and perceptions during error detection tasks. Performance measures included effectiveness and efficiency, and perception measures included confidence, perceived usefulness, and cognitive workload. The results provide strong support for the use of indexed audio in the context of third-party error detection. The results also confirm that consecutive error rate, or the percentage of recognition errors immediately adjacent to another error, has a negative impact on the effectiveness of third-party error detection. Other support mechanisms failed to improve either effectiveness or perceptions, but they did negate the negative impact as consecutive error rate increased. These findings have significant implications for speech recognition error detection research and the design of error detection support solutions.
机译:尽管近年来语音识别已得到显着改善,但其采用仍然受到部分纠正与纠正语音识别错误相关的努力和挫败感的限制。错误检测是第三方错误纠正中一个特别具有挑战性的问题,在该错误中,不同的人负责原始命令并纠正结果文本。这项研究旨在通过开发和评估各种支持机制来解决第三方错误检测中遇到的困难。借助关于人机交互和语音识别的越来越多的文献,设计和评估了四种支持机制,即索引音频,语音摘要,错误预测和替代假设的表示。一项用户研究评估了错误检测任务期间这些支持机制对性能和感知的影响。绩效指标包括有效性和效率,感知指标包括信心,感知有用性和认知工作量。结果为在第三方错误检测的上下文中使用索引音频提供了有力的支持。结果还证实,连续的错误率或与另一个错误紧邻的识别错误的百分比对第三方错误检测的有效性具有负面影响。其他支持机制未能提高有效性或认知度,但是随着连续错误率的增加,它们确实消除了负面影响。这些发现对语音识别错误检测研究和错误检测支持解决方案的设计具有重要意义。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号