首页> 外文会议>International Conference on speech and computer >Medical Speech Recognition: Reaching Parity with Humans
【24h】

Medical Speech Recognition: Reaching Parity with Humans

机译:医学语音识别:与人平等

获取原文

摘要

We present a speech recognition system for the medical domain whose architecture is based on a state-of-the-art stack trained on over 270 h of medical speech data and 30 million tokens of text from clinical episodes. Despite the acoustic challenges and linguistic complexity of the domain, we were able to reduce the system's word error rate to below 16% in a realistic clinical use case. To further benchmark our system, we determined the human word error rate on a corpus covering a wide variety of speakers, working with multiple medical transcription-ists, and found that our speech recognition system performs on a par with humans.
机译:我们提出了一种医学领域的语音识别系统,其体系结构是基于对270小时以上的医学语音数据和3000万个来自临床发作的文本令牌进行训练的最先进的堆栈。尽管该领域存在声学挑战和语言复杂性,但在实际的临床使用案例中,我们仍能够将系统的单词错误率降低至16%以下。为了进一步对我们的系统进行基准测试,我们与多个医学转录专家合作,确定了涵盖各种说话者的语料库上的人类单词错误率,并发现我们的语音识别系统的表现与人类相当。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号