首页> 外国专利> EMOTION DETECTION IN AUDIO INTERACTIONS

EMOTION DETECTION IN AUDIO INTERACTIONS

机译：音频交互中的情感检测

页面导航

摘要
著录项
相似文献

摘要

A method comprising: receiving a plurality of audio segments comprising a speech signal, wherein said audio segments represent a plurality of verbal interactions; receiving labels associated with an emotional state expressed in each of said audio segments; dividing each of said audio segments into a plurality of frames, based on a specified frame duration; extracting a plurality of acoustic features from each of said frames; computing statistics over said acoustic features with respect to sequences of frames representing phoneme boundaries in said audio segments; at a training stage, training a machine learning model on a training set comprising: said statistics associated with said audio segments, and said labels; and at an inference stage, applying said trained model to one or more target audio segments comprising a speech signal, to detect an emotional state expressed in said target audio segments.

机译：一种方法，包括：接收包括语音信号的多个音频段，其中所述音频段代表多个口头相互作用;接收与每个所述音频段中的情绪状态相关的标签;基于指定的帧持续时间将每个所述音频段除以多个帧;从每个所述框架中提取多个声学特征;在所述声学特征上计算所述声学特征的统计数据，所述帧中表示所述音频段中的音素边界;在培训阶段，在训练集上培训机器学习模型，包括：与所述音频段相关的所述统计数据，以及所述标签;在推断阶段，将所述训练模型应用于包括语音信号的一个或多个目标音频段，以检测在所述目标音频段中表达的情绪状态。

著录项

公开/公告号WO2021127615A1

专利类型
公开/公告日2021-06-24

原文格式PDF
申请/专利权人 GREENEDEN U.S. HOLDINGS II LLC;
展开▼

申请/专利号WO2020US66312
发明设计人 FAIZAKOF AVRAHAM;HAIKIN LEV;KONIG YOCHAI;MAZZA ARNON;
展开▼

申请日2020-12-21
分类号G10L25/63;G10L15/04;G10L15/02;G10L25/30;
国家 US
入库时间 2022-08-24 19:36:01

相似文献

专利
外文文献
中文文献