首页>
外国专利>
Parallel processing framework for voice to text digital media
Parallel processing framework for voice to text digital media
展开▼
机译:并行处理框架用于文本数字媒体的语音
展开▼
页面导航
摘要
著录项
相似文献
摘要
A method of converting speech to text comprises receiving an audio recording from an input device comprising speech of a plurality of speakers. Extracting from the audio recording, a speaker audio recording comprising recorded audio of an individual speaker. Selecting, based on a characteristic of the speaker audio recording, a speech to text engine and a dictionary. Configuring the speech to text engine with the dictionary and executing a first conversion process to convert a first portion of the speaker audio recording to produce a first transcript. Evaluating a performance metric of the conversion process against a quality metric to reconfigure the speech to text engine and execute a second conversion process to convert a second portion of the speaker audio recording to produce a second transcript. Combining the first transcript and the second transcript to produce a transcript of the speaker audio recording.
展开▼