首页> 外文期刊>Computer speech and language >Detecting emotional state of a child in a conversational computer game
【24h】

Detecting emotional state of a child in a conversational computer game

机译:在对话式计算机游戏中检测孩子的情绪状态

获取原文
获取原文并翻译 | 示例

摘要

The automatic recognition of user's communicative style within a spoken dialog system framework, including the affective aspects, has received increased attention in the past few years. For dialog systems, it is important to know not only what was said but also how something was communicated, so that the system can engage the user in a richer and more natural interaction. This paper addresses the problem of automatically detecting "frustration", "politeness", and "neutral" attitudes from a child's speech communication cues, elicited in spontaneous dialog interactions with computer characters. Several information sources such as acoustic, lexical, and contextual features, as well as, their combinations are used for this purpose. The study is based on a Wizard-of-Oz dialog corpus of 103 children, 7-14 years of age, playing a voice activated computer game. Three-way classification experiments, as well as, pairwise classification between polite vs. others and frustrated vs. others were performed. Experimental results show that lexical information has more discriminative power than acoustic and contextual cues for detection of politeness, whereas context and acoustic features perform best for frustration detection. Furthermore, the fusion of acoustic, lexical and contextual information provided significantly better classification results. Results also showed that classification performance varies with age and gender. Specifically, for the "politeness" detection task, higher classification accuracy was achieved for females and 10-11 years-olds, compared to males and other age groups, respectively.
机译:在过去的几年中,语音对话系统框架(包括情感方面)中用户的交流风格的自动识别受到了越来越多的关注。对于对话系统,重要的是不仅要知道说了什么,而且还必须知道如何进行通信,以便系统可以使用户参与到更丰富,更自然的交互中。本文讨论了自动检测对话中与计算机角色互动而引起的儿童语音交流提示中的“沮丧”,“礼貌”和“中立”态度的问题。为此,使用了一些信息源,例如声音,词汇和上下文特征及其组合。这项研究基于103个年龄在7-14岁的儿童的绿野仙踪对话语料库,他们玩声控计算机游戏。进行了三向分类实验,以及礼貌对他人与沮丧对他人之间的成对分类。实验结果表明,词汇信息比礼貌用语的听觉和上下文提示具有更大的判别力,而语境和听觉特征在挫折感方面表现最好。此外,声音,词汇和上下文信息的融合提供了明显更好的分类结果。结果还显示分类表现随年龄和性别而变化。具体而言,对于“礼貌”检测任务,与男性和其他年龄组相比,女性和10-11岁分别获得了更高的分类准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号