首页> 外国专利> EMOTION DETECTION IN AUDIO INTERACTIONS

EMOTION DETECTION IN AUDIO INTERACTIONS

机译:音频交互中的情感检测

摘要

A method comprising: receiving a plurality of audio segments comprising a speech signal, wherein said audio segments represent a plurality of verbal interactions; receiving labels associated with an emotional state expressed in each of said audio segments; dividing each of said audio segments into a plurality of frames, based on a specified frame duration; extracting a plurality of acoustic features from each of said frames; computing statistics over said acoustic features with respect to sequences of frames representing phoneme boundaries in said audio segments; at a training stage, training a machine learning model on a training set comprising: said statistics associated with said audio segments, and said labels; and at an inference stage, applying said trained model to one or more target audio segments comprising a speech signal, to detect an emotional state expressed in said target audio segments.
机译:一种方法,包括:接收包括语音信号的多个音频段,其中所述音频段代表多个口头相互作用;接收与每个所述音频段中的情绪状态相关的标签;基于指定的帧持续时间将每个所述音频段除以多个帧;从每个所述框架中提取多个声学特征;在所述声学特征上计算所述声学特征的统计数据,所述帧中表示所述音频段中的音素边界;在培训阶段,在训练集上培训机器学习模型,包括:与所述音频段相关的所述统计数据,以及所述标签;在推断阶段,将所述训练模型应用于包括语音信号的一个或多个目标音频段,以检测在所述目标音频段中表达的情绪状态。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号