首页> 外国专利> Human-based accent detection to assist rapid transcription with automatic speech recognition

Human-based accent detection to assist rapid transcription with automatic speech recognition

机译:基于人的重音检测可通过自动语音识别帮助快速转录

摘要

Knowing what accent is spoken can assist automatic speech recondition (ASR) systems to more accurately transcribe audio. In one embodiment, a system includes a frontend server configured to transmit, to a backend server, an audio recording that includes speech of one or more people in a room over a period spanning at least two hours. At sonic time during the first hour of the period, the backend server provides a transcriber with a certain segment of the audio recording, and receives, from the transcriber, after the transcriber listened to a certain segment, an indication indicative of an accent of a person who spoke in the certain segment. The backend server then provides the indication to an ASR system to be utilized to generate a transcription of an additional portion of the audio recording, which was recorded after the first twenty minutes of the period.
机译:知道说什么口音可以帮助自动语音重新调整(ASR)系统更准确地转录音频。在一个实施例中,一种系统包括前端服务器,该前端服务器被配置为将音频记录传输到后端服务器,该音频记录包括在跨越至少两个小时的时间段内房间中一个或多个人的语音。在该时段的第一个小时的声音时间,后端服务器向转录员提供音频记录的特定片段,并在转录者收听特定片段后从转录者接收指示重音的指示。在特定细分中讲话的人。然后,后端服务器将指示提供给ASR系统,以用于生成音频记录的其他部分的转录,该转录是在该时段的前二十分钟之后记录的。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号