首页> 外文会议>International Speech Communication Association >A Language-Modeling Approach to Inverse Text Normalization and DataCleanup for Multimodal Voice Search Applications

【24h】

A Language-Modeling Approach to Inverse Text Normalization and DataCleanup for Multimodal Voice Search Applications

机译：用于多模式语音搜索应用的逆文本归一化和DataCleanup的语言建模方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we address two related challenges in multimodal local search applications on mobile devices: first, correctly displaying the business names, and second, harvesting language model training data from an inconsistently labeled corpus. We investigate the impact of common text normalization and the quality of language model training corpus on the accuracy of displayed results. We propose a new language model framework that eliminates the need for explicit inverse text normalization. The same framework can be applied to sift through corrupted language model training data. Our new language model is 25% more accurate while 25% smaller in size.

机译：在本文中，我们在移动设备上为多模式本地搜索应用中解决了两个相关挑战：首先，正确显示业务名称，第二个，从不一致标记的语料库中收集语言模型培训数据。我们调查常见文本规范化的影响和语言模型培训语料库的影响，以表现出现的准确性。我们提出了一种新的语言模型框架，消除了对明确的逆文本归一代的需要。可以应用于通过损坏的语言模型培训数据筛选相同的框架。我们的新语言模型更准确，而25％的大小较小。

著录项

来源
《International Speech Communication Association》|2008年||共4页
会议地点
作者
Yun-Cheng Ju; Julian Odell;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912.3-532;
关键词
text normalization; inverse text normalization; language model; multimodal; voice search; transduction; language resources;

机译：文本归一化;逆文本规范化;语言模型;多模式;语音搜索;转发;语言资源;

相似文献

外文文献
中文文献
专利

1. "Method and System for Converting Image Text Documents in Bit-Mapped Formats to Searchable Text and for Searching the Searchable Text" in Patent Application Approval Process [J] . Robotics and Machine Learning . 2013,第1期

机译：专利申请批准过程中的“将位图格式的图像文本文档转换为可搜索文本并搜索可搜索文本的方法和系统”
2. The voice, text, and the visual as semiotic companions: an analysis of the materiality and meaning potential of multimodal screen feedback [J] . Tyrer Clare Education and information technologies . 2021,第4期

机译：语音，文本和视觉作为符号生伴侣：多式联屏幕反馈的唯物性和意义潜力分析
3. A clustering approach using a combination of gravitational search algorithm and k-harmonic means and its application in text document clustering [J] . MINA MIRHOSSEINI Turkish Journal of Electrical Engineering and Computer Sciences . 2017,第2期

机译：结合重力搜索算法和k-调和手段的聚类方法及其在文本文档聚类中的应用
4. A Language-Modeling Approach to Inverse Text Normalization and DataCleanup for Multimodal Voice Search Applications [C] . Yun-Cheng Ju, Julian Odell International Speech Communication Association . 2008

机译：用于多模式语音搜索应用的逆文本归一化和DataCleanup的语言建模方法
5. "Please, Tell Them!": Voices from College Classrooms on Effects of Resources of Multimodal Ensembles on Polylingual EAL Speaking College Students' Meaning Making of Conventional Print-Based Texts [D] . Gould, Olga. 2017

机译：“请告诉他们！”：大学教室里关于多模式合唱团资源对多语言EAL口语的影响，大学生对传统印刷文本的理解
6. Research and applications: MedXN: an open source medication extraction and normalization tool for clinical text [O] . Sunghwan Sohn, Cheryl Clark, Scott R Halgrim, 2014

机译：研究与应用：MedXN：用于临床试验的开源药物提取和标准化工具
7. Learning Multimodality through Genre-Based Multimodal Texts Analysis: Listening to Students’ Voices [O] . Fuad Abdullah, Soni Tantan Tandiana, Yuyus Saputra 2020

机译：通过基于类型的多媒体文本分析学习多语言：听取学生的声音
8. PDE-Constrained Optimization Approach to Uncertainty in Inverse Problems with Applications to Inverse Scattering. [R] . Biros, G., Ghattas, O. 2010

机译：逆问题不确定性的偏微分约束优化方法及其在逆散射中的应用。

A Language-Modeling Approach to Inverse Text Normalization and DataCleanup for Multimodal Voice Search Applications

摘要

著录项

相似文献

相关主题

期刊订阅