Are You Looking at Me, Are You Talking with Me: Multimodal Classification of the Focus of Attention

机译：你在看着我，你在和我谈话：多模式分类的关注焦点

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Automatic dialogue systems get easily confused if speech is recognized which is not directed to the system. Besides noise or other people's conversation, even the user's utterance can cause difficulties when he is talking to someone else or to himself ("Off-Talk"). In this paper the automatic classification of the user's focus of attention is investigated. In the German SmartWeb project, a mobile device is used to get access to the semantic web. In this scenario, two modalities are provided - speech and video signal. This makes it possible to classify whether a spoken request is addressed to the system or not: with the camera of the mobile device, the user's gaze direction is detected; in the speech signal, prosodic features are analyzed. Encouraging recognition rates of up to 93% are achieved in the speech-only condition. Further improvement is expected from the fusion of the two information sources.

机译：如果识别出语言，则自动对话系统很容易混淆，这没有针对系统。除了噪音或其他人的谈话外，即使用户的话语也会在与别人或他自己交谈时会造成困难（“脱谈”）。本文研究了用户对关注焦点的自动分类。在德国SmartWeb项目中，移动设备用于访问语义Web。在这种情况下，提供了两个模态 - 语音和视频信号。这使得可以对系统进行分类，或者可以对系统进行分类：使用移动设备的相机，检测用户的凝视方向;在语音信号中，分析韵律特征。令人鼓舞的识别率高达93％在唯一的演讲条件下实现。预计两个信息来源的融合将进一步改进。

著录项

来源
《International Conference on Text, Speech and Dialogue》|2006年||共8页
会议地点
作者
Christian Hacker; Anton Batliner; Elmar Noth;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. The effect of attentional focusing strategies on EMG-based classification [J] . Ay Ayse Nur, Yildiz Mustafa Zahid Biomedizinische Technik . 2021,第2期

机译：注意力集中策略对基于EMG的分类的影响
2. Correction to: Attention-based multimodal contextual fusion for sentiment and emotion classification using bidirectional LSTM [J] . Huddar Mahesh G., Sannakki Sanjeev S., Rajpurohit Vijay S. Multimedia Tools and Applications . 2021,第9期

机译：校正：使用双向LSTM的情感和情感分类的关注多峰语境融合
3. Attention-based multimodal contextual fusion for sentiment and emotion classification using bidirectional LSTM [J] . Huddar Mahesh G., Sannakki Sanjeev S., Rajpurohit Vijay S. Multimedia Tools and Applications . 2021,第9期

机译：使用双向LSTM的情感和情感分类的关注多模式语境融合
4. Are You Looking at Me, Are You Talking with Me: Multimodal Classification of the Focus of Attention [C] . Christian Hacker, Anton Batliner, Elmar Noeth International Conference on Text, Speech and Dialogue(TSD 2006); 20060911-15; Brno(CZ) . 2006

机译：您在看我吗，在和我聊天：注意焦点的多模式分类
5. Focusing attention to deep structure in math problems: Effects on students with attention disorders. [D] . Kercood, Suneeta. 2000

机译：将注意力集中在数学问题的深层结构上：对注意力障碍学生的影响。
6. Hybrid Attention based Multimodal Network for Spoken Language Classification [O] . Yue Gu, Kangning Yang, Shiyu Fu, -1

机译：基于混合注意力的多模态网络用于口语分类
7. Multimodal Real-Time Focus of Attention Estimation in SmartRooms [O] . C. Canton-ferrer, C. Segura, M. Pardàs, 2014

机译：SmartRoom中注意力估计的多模式实时焦点
8. Multimodal Interfaces: Literature Review of Ecological Interface Design, Multimodal Perception and Attention, and Intelligent Adaptive Multimodal Interfaces [R] . Giang, W., Santhakumaran, S., Masnavi, E., 2010

机译：多模态界面：生态界面设计，多模式感知和注意以及智能自适应多模态界面的文献综述

Are You Looking at Me, Are You Talking with Me: Multimodal Classification of the Focus of Attention

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅