Cache-based Language Model Adaptation using Visual Attention for ASR in Meeting Scenarios

机译：在会议场景中使用Visual Attention for ASR的基于缓存的语言模型自适应

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In a typical group meeting involving discussion and collaboration, people look at one another, at shared information resources such as presentation material, and also at nothing in particular. In this work we investigate whether the knowledge of what a person is looking at may improve the performance of Automatic Speech Recognition (ASR). A framework for cache Language Model (LM) adaptation is proposed with the cache based on a person's Visual Attention (VA) sequence. The framework attempts to measure the appropriateness of adaptation from VA sequence characteristics. Evaluation on the AMI Meeting corpus data shows reduced LM perplexity. This work demonstrates the potential for cache-based LM adaptation using VA information in large vocabulary ASR deployed in meeting scenarios.

机译：在一个包含讨论和协作的典型小组会议中，人们互相看着，在共享的信息资源（例如演示材料）上互相看，也没有什么特别的。在这项工作中，我们调查了一个人在看什么的知识是否可以改善自动语音识别（ASR）的性能。提出了一种基于人的视觉注意（VA）序列的缓存语言模型（LM）适应框架。该框架试图从VA序列特征来衡量适应的适当性。对AMI Meeting语料库数据的评估显示，LM的困惑度有所降低。这项工作展示了在会议场景中部署的大词汇量ASR中使用VA信息进行基于缓存的LM适应的潜力。

著录项

来源
《International conference on multimodal interfaces and workshop on machine learning for multimodal interfaces 2009》|2009年|P.87-90|共4页
会议地点
作者
Neil Cooke; Martin Russell;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
algorithms; experimentation; measurement; performance;

机译：算法;实验测量;性能;

相似文献

外文文献
中文文献
专利

1. Language Model Adaptation for ASR Using Machine-Translated Data [J] . Arnar THOR JENSSON, Edward W.D. WHITTAKER, Koji IWANO, 電子情報通信学会技術研究報告. 音声. Speech . 2005,第132期

机译：使用机器翻译数据的ASR语言模型适应
2. Cache-based Statistical Language Models of English and Highly Inflected Lithuanian [J] . Airenas Vaiciunas, Gailius Raskinis Informatica . 2006,第1期

机译：基于缓存的英语和高度变形的立陶宛语统计语言模型
3. Psychometric Properties and Adaptation of the ASRS in a Spanish Sample of Patients With Substance Use Disorders: Application of Two IRT Rasch Models [J] . Sanchez-Garcia Manuel, Fernandez-Calderon Fermin, Carmona-Marquez Jose, Psychological assessment . 2015,第2期

机译：西班牙物质使用障碍患者样本中的心理测量特性和ASRS适应：两种IRT Rasch模型的应用
4. Cache-based language model adaptation using visual attention for ASR in meeting scenarios [C] . Neil J. Cooke, Martin J. Russell International conference on multimodal interfaces and workshop on machine learning for multimodal interfaces 2009 . 2009

机译：在会议场景中使用视觉注意力针对ASR进行基于缓存的语言模型自适应
5. Modeling organizational information search behavior and technology adaptation in the meetings and convention industry: A comparison of CVBs and meeting planners [D] . Kim, Dae-Young 2006

机译：会议和会议行业中的组织信息搜索行为和技术适应性建模：CVB和会议计划者的比较
6. An amodal shared resource model of language-mediated visual attention [O] . Alastair C. Smith, Padraic Monaghan, Falk Huettig 2013

机译：语言介导的视觉注意的非模式共享资源模型
7. Cache-based Language Model Adaptation using Visual Attention for ASR in Meeting Scenarios [O] . Neil Cooke, Martin Russell 2015

机译：在会议场景中使用视觉注意asR进行基于缓存的语言模型自适应
8. Visual Unified Modeling Language for the Composition of Scenarios in Modeling and Simulation Systems [R] . Swayne, D. E. 2004

机译：用于建模和仿真系统中场景组合的可视统一建模语言

Cache-based Language Model Adaptation using Visual Attention for ASR in Meeting Scenarios

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅