首页> 外国专利> Parallel processing framework for voice to text digital media

Parallel processing framework for voice to text digital media

机译：并行处理框架用于文本数字媒体的语音

页面导航

摘要
著录项
相似文献

摘要

A method of converting speech to text comprises receiving an audio recording from an input device comprising speech of a plurality of speakers. Extracting from the audio recording, a speaker audio recording comprising recorded audio of an individual speaker. Selecting, based on a characteristic of the speaker audio recording, a speech to text engine and a dictionary. Configuring the speech to text engine with the dictionary and executing a first conversion process to convert a first portion of the speaker audio recording to produce a first transcript. Evaluating a performance metric of the conversion process against a quality metric to reconfigure the speech to text engine and execute a second conversion process to convert a second portion of the speaker audio recording to produce a second transcript. Combining the first transcript and the second transcript to produce a transcript of the speaker audio recording.

机译：将语音转换为文本的方法包括从包括多个扬声器的语音接收来自输入设备的音频记录。从音频记录中提取，扬声器音频记录包括单独扬声器的记录音频。根据扬声器音频录制的特征选择，发表到文本引擎和字典。将语音与字典配置为文本引擎，并执行第一转换过程以转换扬声器音频记录的第一部分以产生第一兆字谜。评估转换过程的转换过程的性能度量，以将语音重新配置到文本引擎，并执行第二转换过程以转换扬声器音频记录的第二部分以产生第二副本。组合第一转录物和第二转录物以产生扬声器音频记录的转录器。

著录项

公开/公告号US11152005B2

专利类型
公开/公告日2021-10-19

原文格式PDF
申请/专利权人 VIQ SOLUTIONS INC.;
展开▼

申请/专利号US201916567143
发明设计人 MALCOLM MACALLUM;
展开▼

申请日2019-09-11
分类号G10L17;G10L25/63;
国家 US
入库时间 2022-08-24 21:44:50

相似文献

专利
外文文献
中文文献