首页> 外国专利> AUDIO FILE LABELING PROCESS FOR BUILDING DATASETS AT SCALE

AUDIO FILE LABELING PROCESS FOR BUILDING DATASETS AT SCALE

机译：大规模构建数据集的音频文件标记过程

页面导航

摘要
著录项
相似文献

摘要

An audio labeling tool is described for rapidly and efficiently building a labeled data set preferably comprising audio files with annotations denoting speaker transcriptions, speaker identity, periods of silence, background noise, and speaker emotion labels. The tool provides a configurable user interface (UI) with keyboard shortcuts and menu items for streamlined user-guided markup of audio files with context specific labeling and transcription notes. Human audio file labelers will preferably leverage the labeling tool and configurable user interface and menu to rapidly annotate, validate, and build labeled data sets, at scale, of time sliced audio files. The audio labeling tool is preferably applied to audio files and provides an automated means for adding notations, markup, text transcription, and feature labeling to an audio waveform, spectrogram, or other audio data visualization domain for amplifying and highlighting audio feature details. Labeled audio data is preferably used to train computer models, develop pattern recognition algorithms, and neural networks for automated processing, pre-populating and labeling of significantly large audio data sets at scale.

机译：描述了一种音频标记工具，用于快速和有效地构建标记的数据集，该标记的数据集优选地包括音频文件，该音频文件具有注释，该注释指示说话者转录，说话者身份，沉默期，背景噪声和说话者情感标签。该工具提供了带有键盘快捷键和菜单项的可配置用户界面（UI），可通过上下文特定的标签和转录注释简化用户指导的音频文件标记。人类音频文件标记者将优选地利用标记工具以及可配置的用户界面和菜单来按比例地快速注释，验证和构建按时间划分的音频文件的标记数据集。音频标记工具优选地应用于音频文件，并提供用于将标记，标记，文本转录和特征标记添加到音频波形，频谱图或其他音频数据可视化域的自动化手段，以放大和突出显示音频特征细节。标记的音频数据最好用于训练计算机模型，开发模式识别算法和神经网络，以自动处理，预先填充和标记大规模的大型音频数据集。

著录项

公开/公告号US2019362022A1

专利类型
公开/公告日2019-11-28

原文格式PDF
申请/专利权人 RISTO HAUKIOJA;LEON MAK;MARK RUIZ;ELI SAKOV;
展开▼

申请/专利号US201815989205
发明设计人 RISTO HAUKIOJA;LEON MAK;MARK RUIZ;ELI SAKOV;
展开▼

申请日2018-05-25
分类号G06F17/30;G10L15/04;G06F3/0482;
国家 US
入库时间 2022-08-21 11:19:21

相似文献

专利
外文文献
中文文献