首页> 外文会议>Computer vision systems >Multilevel Integration of Vision and Speech Understanding Using Bayesian Networks

【24h】

Multilevel Integration of Vision and Speech Understanding Using Bayesian Networks

机译：使用贝叶斯网络的视觉和语音理解的多层次集成

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The interaction of image and speech processing is a crucial property of multimedia systems. Classical systems using inferences on pure qualitative high level descriptions miss a lot of information when concerned with erroneous, vague, or incomplete data. We propose a new architecture that integrates various levels of processing by using multiple representations of the visually observed scene. They are vertically connected by Bayesian networks in order to find the most plausible interpretation of the scene.rnThe interpretation of a spoken utterance naming an object in the visually observed scene is modeled as another partial representation of the scene. Using this concept, the key problem is the identification of the verbally specified object instances in the visually observed scene. Therefore, a Bayesian network is generated dynamically from the spoken utterance and the visual scene representation. In this network spatial knowledge as well as knowledge extracted from psycholinguistic experiments is coded. First results show the robustness of our approach.

机译：图像和语音处理的交互是多媒体系统的关键属性。当涉及错误，模糊或不完整的数据时，使用纯定性高级描述推论的经典系统会丢失很多信息。我们提出了一种新的体系结构，该体系结构通过使用视觉观察场景的多种表示形式来集成各种级别的处理。它们之间通过贝叶斯网络垂直连接，以找到对场景的最合理解释。在视觉观察的场景中命名对象的口头话语的解释被建模为场景的另一部分表示。使用此概念，关键问题是在视觉观察到的场景中识别口头指定的对象实例。因此，贝叶斯网络是根据语音和视觉场景表示动态生成的。在该网络中，对空间知识以及从心理语言实验中提取的知识进行编码。初步结果表明了我们方法的鲁棒性。

著录项

来源
《Computer vision systems》|1999年|231-254|共24页
会议地点 Las Palmas(ES);Las Palmas(ES)
作者
Sven Wachsmuth; Hans Brandt-Pook; Gudrun Socher; Franz Kummert; Gerhard Sagerer;
展开▼
作者单位

University of Bielefeld, Technical Faculty, P.O. Box 100131, 33501 Bielefeld, Germany;

University of Bielefeld, Technical Faculty, P.O. Box 100131, 33501 Bielefeld, Germany;

Vidam Communications Inc., 2 N 1 st St. ,San Jose,CA 95113;

University of Bielefeld, Technical Faculty, P.O. Box 100131, 33501 Bielefeld, Germany;

University of Bielefeld, Technical Faculty, P.O. Box 100131, 33501 Bielefeld, Germany;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类电子模拟计算机（连续作用电子计算机）;
关键词

相似文献

外文文献
中文文献
专利

1. Multilevel functional genomics data integration as a tool for understanding physiology: a network biology perspective [J] . Davidsen Peter K., Turan Nil, Egginton Stuart, Journal of applied physiology . 2016,第3期

机译：多级功能基因组学数据集成作为理解生理的工具：网络生物学的观点
2. Multilevel functional genomics data integration as a tool for understanding physiology: a network biology perspective [J] . Peter K. Davidsen, Nil Turan, Stuart Egginton, Journal of applied physiology . 2016,第2期

机译：多级功能基因组学数据集成作为理解生理学的工具：网络生物学视角
3. Multilevel functional genomics data integration as a tool for understanding physiology: a network biology perspective [J] . Davidsen Peter K., Turan Nil, Egginton Stuart, Journal of applied physiology . 2016,第3期

机译：多级功能基因组学数据集成作为理解生理学的工具：网络生物学视角
4. Multilevel Integration of Vision and Speech Understanding Using Bayesian Networks [C] . International conference on computer vision systems . 1999

机译：贝叶斯网络的多级愿景和语音理解的整合
5. Integrate qualitative biological knowledge for gene regulatory network reconstruction with dynamic Bayesian networks [D] . Li, Song 2007

机译：将定性生物学知识与动态贝叶斯网络相结合以进行基因调控网络的重建
6. Candidate gene association study in pediatric acute lymphoblastic leukemia evaluated by Bayesian network based Bayesian multilevel analysis of relevance [O] . Orsolya Lautner-Csorba, András Gézsi, Ágnes F Semsei, 2012

机译：基于贝叶斯网络的贝叶斯相关性多级分析评估小儿急性淋巴细胞白血病候选基因的关联性研究
7. Multilevel Integration of Vision and Speech Understanding Using Bayesian Networks [O] . Sven Wachsmuth, Hans Brandt-Pook Gerhard, Gudrun Socher, 1999

机译：使用贝叶斯网络的视觉和语音理解的多层次集成
8. Technical Topic 3.2.2.d Bayesian and Non-Parametric Statistics: Integration of Neural Networks with Bayesian Networks for Data Fusion and Predictive Modeling. [R] . Bell, S. 2016

机译：技术主题3.2.2.d贝叶斯和非参数统计：神经网络与贝叶斯网络的集成，用于数据融合和预测建模。

Multilevel Integration of Vision and Speech Understanding Using Bayesian Networks

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅