首页> 外文OA文献 >Bimodal Audiovisual Perception in Interactive Application Systems of Moderate Complexity
【2h】

Bimodal Audiovisual Perception in Interactive Application Systems of Moderate Complexity

机译:中等复杂度交互式应用系统中的双峰视听感知

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The dissertation at hand deals with aspects of quality perception ofinteractive audiovisual application systems of moderate complexity as e.g.defined in the MPEG-4 standard. Because in these systems the availablecomputing power is limited, it is decisive to know which factors influencethe perceived quality. Only then can the available computing power bedistributed in the most effective and efficient way for the simulation anddisplay of audiovisual 3D scenes. Whereas quality factors for the unimodalauditory and visual stimuli are well known and respective models ofperception have been successfully devised based on this knowledge, this isnot true for bimodal audiovisual perception. For the latter, it is onlyknown that some kind of interdependency between auditory and visualperception does exist. The exact mechanisms of human audiovisual perceptionhave not been described. It is assumed that interaction with an applicationor scene has a major influence upon the perceived overall quality.The goal of this work was to devise a system capable of performingsubjective audiovisual assessments in the given context in a largelyautomated way. By applying the system, first evidence regarding audiovisualinterdependency and influence of interaction upon perception should becollected. Therefore this work was composed of three fields of activities:the creation of a test bench based on the available but (regarding theaudio functionality) somewhat restricted MPEG-4 player, the preoccupationwith methods and framework requirements that ensure comparability andreproducibility of audiovisual assessments and results, and the performanceof a series of coordinated experiments including the analysis andinterpretation of the collected data. An object-based modular audiorendering engine was co-designed and -implemented which allows to performsimple room-acoustic simulations based on the MPEG-4 scene descriptionparadigm in real-time. Apart from the MPEG-4 player, the test benchconsists of a haptic Input Device used by test subjects to enter theirquality ratings and a logging tool that allows to journalize all relevantevents during an assessment session. The collected data can be exportedcomfortably for further analysis using appropriate statistic tools.A thorough analysis of the well established test methods andrecommendations for unimodal subjective assessments was performed to findout whether a transfer to the audiovisual bimodal case is easily possible.It became evident that - due to the limited knowledge about the underlyingperceptual processes - a novel categorization of experiments according totheir goals could be helpful to organize the research in the field.Furthermore, a number of influencing factors could be identified thatexercise control over bimodal perception in the given context.By performing the perceptual experiments using the devised system, itsfunctionality and ease of use was verified. Apart from that, some firstindications for the role of interaction in perceived overall quality havebeen collected: interaction in the auditory modality reduces a human'sability of correctly rating the audio quality, whereas visually based(cross-modal) interaction does not necessarily generate this effect.
机译:本文涉及诸如MPEG-4标准中所定义的中等复杂度的交互式视听应用系统的质量感知方面。因为在这些系统中可用的计算能力是有限的,所以决定哪些因素影响感知质量至关重要。只有这样,才能以最有效和高效的方式分配可用的计算能力,以模拟和显示视听3D场景。尽管众所周知,单模态听觉和视觉刺激的质量因素,并且基于此知识已成功设计了各自的感知模型,但双模态视听感知并非如此。对于后者,只知道听觉和视觉感知之间确实存在某种相互依存关系。尚未描述人类视听感知的确切机制。假定与应用程序场景的交互对感知的总体质量有重要影响。这项工作的目的是设计一种能够在很大程度上自动地在给定上下文中执行主观视听评估的系统。通过应用该系统,应收集有关视听相互依赖性和交互作用对感知的影响的初步证据。因此,这项工作由三个活动领域组成:基于可用的(关于音频功能)受限制的MPEG-4播放器创建测试平台,对确保视听评估和结果具有可比性和可再现性的方法和框架要求的关注,以及一系列协调实验的执行,包括对收集到的数据的分析和解释。共同设计和实现了一个基于对象的模块化音频渲染引擎,该引擎允许基于MPEG-4场景描述范例实时执行简单的房间声学模拟。除MPEG-4播放器外,测试台还包括供测试对象用来输入其质量等级的触觉输入设备,以及一个允许在评估会议中记录所有相关事件的日志记录工具。可以使用适当的统计工具轻松地导出收集的数据,以进行进一步分析。对完善的测试方法和单峰主观评估的建议进行了全面分析,以发现是否容易转移到视听双峰案例。由于对基本感知过程的了解有限-根据其目标对实验进行新颖的分类可能有助于组织该领域的研究。此外,可以识别出在给定背景下锻炼对双峰感知控制的许多影响因素。利用设计的系统进行了感知实验,验证了其功能性和易用性。除此之外,还收集了一些有关交互作用在感知的总体质量中的作用的最初指示:听觉方式中的交互作用会降低人们正确评估音频质量的能力,而基于视觉的(跨模式)交互作用不一定会产生这种效果。 。

著录项

  • 作者

    Reiter Ulrich;

  • 作者单位
  • 年度 2009
  • 总页数
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号