首页> 外文学位 >Patterns of evidence: Investigating information in visible speech perception.
【24h】

Patterns of evidence: Investigating information in visible speech perception.

机译:证据模式:调查可见语音感知中的信息。

获取原文
获取原文并翻译 | 示例

摘要

The goal of this investigation was to find the information in a speaker's face used in visible speech perception. Three hypotheses were also tested: (a) information is statistically independent and not just orthogonal; (b) independent features are more functional than orthogonal features, six visible features, or linguistic features; (c) multiplicative integration of the FLMP better describes information processing than additive integration (AMP) or non-integration (RACE).; At the outset, information was clearly defined and five components of information were identified: (a) Empirical; (b) Functional; (c) Contextual; (d) Independent; and (e) Minimal. These five components are criteria that must be satisfied to concluded that information has been uncovered in any inquiry. To meet all five criteria a General Framework of Perception (GFP) was proposed as a basis for a General Model of Perception (GMP). GMP meets all the criteria by linking models of sensory encoding, Independent Components Analysis (ICA), with models of information processing, Multidimensional Fuzzy Logical Model of Perception (MD-FLMP).; ICA was trained on images of a speaker saying 22 English consonants in three vowel contexts. The resulting features of ICA and several competing theories were then fit against subject responses using MD-FLMP, AMP, and RACE. Responses were obtained from an experiment in which subjects viewed the same speaker saying one of 264 monosyllabic English words without sound and identified the word that was spoken. Responses for the initial CV of each word were collected and organized in a consonant confusion matrix.; Results showed that information comes from areas of the face not typically explored in studies of visible speech; namely, chin movement independent of jaw rotation, neck flexation, and skin warping/wrinkling. Information was also found from several well-known facial regions including the lips, teeth, and jaw. Several lip configurations were functional including rounding, unrounding, spreading, compression, and protrusion. Model tests confirmed all three hypotheses supporting the functionality of the independent features and the multiplicative integration assumption of the MD-FLMP over additive integration and non-integration. Applications of this approach to human speech training and a visible speech recognition system were discussed.
机译:这项调查的目的是在可见的语音感知中找到说话者面部的信息。还检验了三个假设:(a)信息在统计上是独立的,而不仅仅是正交的; (b)独立特征比正交特征,六个可见特征或语言特征更具功能性; (c)FLMP的乘法集成比加法积分(AMP)或非积分(RACE)更好地描述了信息处理;首先,明确定义了信息,并确定了信息的五个组成部分:(a)实证; (b)职能; (c)情境; (d)独立; (e)最少。这五个组成部分是得出任何查询中已发现信息的结论所必须满足的标准。为了满足所有五个标准,提出了通用感知框架(GFP)作为通用感知模型(GMP)的基础。 GMP通过将感官编码模型,独立成分分析(ICA)与信息处理模型,多维模糊逻辑感知模型(MD-FLMP)链接起来,从而满足所有标准。 ICA接受了在三个元音环境中讲22个英语辅音的说话者图像的培训。然后,使用MD-FLMP,AMP和RACE将ICA的最终功能和几种竞争性理论与受试者的反应相吻合。回答是从一个实验中获得的,在该实验中,受试者观看同一说话者说出264个单音节英语单词中的一个而没有声音,并识别出该单词。收集每个单词的初始简历的响应,并将其组织在辅音混淆矩阵中。结果表明,信息来自面部区域,而这些区域通常是在可见语音研究中未曾探索过的。即下巴运动与下颌旋转,颈部屈曲和皮肤翘曲/起皱无关。还从几个著名的面部区域(包括嘴唇,牙齿和下巴)中找到了信息。几种唇形具有功能,包括倒圆,不倒圆,散布,压缩和突出。模型测试证实了所有三个假设,这些假设支持独立特征的功能以及MD-FLMP在累加积分和非积分上的乘法积分假设。讨论了这种方法在人类语音训练和可见语音识别系统中的应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号