首页> 外国专利> Crowd sourcing audio transcription via re-speaking

Crowd sourcing audio transcription via re-speaking

机译：人群通过重新说话获得音频转录

页面导航

摘要
著录项
相似文献

摘要

Speech audio that is intended for transcription into textual form is received. The received speech audio is divided into first speech segments. A plurality of speakers is identified. A speaker is configured for repeating in spoken form a first speech segment that the speaker has listened to. A subset of speakers is determined for sending each first speech segment. Each first speech segment is sent to the subset of speakers determined for the particular first speech segment. The second speech segments are received from the speakers. The second speech segment is a re-spoken version of a first speech segment that has been generated by a speaker by repeating in spoken form the first speech segment. The second speech segments are processed to generate partial transcripts. The partial transcripts are combined to generate a complete transcript that is a textual representation corresponding to the received speech audio.

机译：接收旨在转录为文本形式的语音音频。所接收的语音音频被分为第一语音段。识别出多个说话者。扬声器被配置为以语音形式重复扬声器已经收听的第一语音片段。确定说话者的子集以发送每个第一语音片段。每个第一语音片段被发送到针对特定的第一语音片段确定的说话者子集。从扬声器接收第二语音片段。第二语音片段是说话者通过以第一语音片段的语音形式重复而产生的第一语音片段的重语音版本。第二语音片段被处理以生成部分成绩单。组合部分成绩单以生成完整的成绩单，该成绩单是与接收到的语音音频相对应的文本表示。

著录项

公开/公告号US9418660B2

专利类型
公开/公告日2016-08-16

原文格式PDF
申请/专利权人 CISCO TECHNOLOGY INC.;
展开▼

申请/专利号US201414156032
发明设计人 MATTHIAS PAULIK;VIVEK HALDER;ANANTH SANKAR;
展开▼

申请日2014-01-15
分类号G10L15/26;G06Q10/06;G10L15/04;G10L25/87;G10L15/32;G10L15/07;
国家 US
入库时间 2022-08-21 14:32:10

相似文献

专利
外文文献
中文文献