首页> 外国专利> FILLER WORD DETECTION THROUGH TOKENIZING AND LABELING OF TRANSCRIPTS

FILLER WORD DETECTION THROUGH TOKENIZING AND LABELING OF TRANSCRIPTS

机译：通过令威胁和标记转录物的填充词检测

页面导航

摘要
著录项
相似文献

摘要

Introduced here are computer programs and associated computer-implemented techniques for discovering the presence of filler words through tokenization of a transcript derived from audio content. When audio content is obtained by a media production platform, the audio content can be converted into text content as part of a speech-to-text operation. The text content can then be tokenized and labeled using a Natural Language Processing (NLP) library. Tokenizing/labeling may be performed in accordance with a series of rules associated with filler words. At a high level, these rules may examine the text content (and associated tokens/labels) to determine whether patterns, relationships, verbatim, and context indicate that a term is a filler word. Any filler words that are discovered in the text content can be identified as such so that appropriate action(s) can be taken.

机译：这里介绍的是计算机程序和相关的计算机实现的技术，用于通过授予来自音频内容的成绩单来发现填充词的存在。当媒体生产平台获得音频内容时，音频内容可以作为语音到文本操作的一部分转换为文本内容。然后可以使用自然语言处理（NLP）库来刻录和标记文本内容。可以根据与填充词相关联的一系列规则来执行令牌化/标记。在高级别中，这些规则可以检查文本内容（以及关联的令牌/标签）以确定模式，关系，逐字和上下文是否表明术语是填充字。可以识别在文本内容中发现的任何填充单词，这样可以拍摄适当的动作。

著录项

公开/公告号US2022036004A1

专利类型
公开/公告日2022-02-03

原文格式PDF
申请/专利权人 DESCRIPT INC.;
展开▼

申请/专利号US202017094533
发明设计人 ALEXANDRE DE BRÉBISSON;ANTOINE DANDIGNÉ;
展开▼

申请日2020-11-10
分类号G06F40/284;G06F40/205;G10L15/26;
国家 US
入库时间 2022-08-24 23:36:25

相似文献

专利
外文文献
中文文献