首页> 外文期刊>IEEE transactions on information technology in biomedicine >A Novel Method of Language Modeling for Automatic Captioning in TC Video Teleconferencing
【24h】

A Novel Method of Language Modeling for Automatic Captioning in TC Video Teleconferencing

机译:TC视频电话会议中自动字幕的语言建模新方法

获取原文
获取原文并翻译 | 示例
       

摘要

We are developing an automatic captioning system for teleconsultation video teleconferencing (TC-VTC) in telemedicine, based on large vocabulary conversational speech recognition. In TC-VTC, doctors'' speech contains a large number of infrequently used medical terms in spontaneous styles. Due to insufficiency of data, we adopted mixture language modeling, with models trained from several datasets of medical and nonmedical domains. This paper proposes novel modeling and estimation methods for the mixture language model (LM). Component LMs are trained from individual datasets, with class n-gram LMs trained from in-domain datasets and word n-gram LMs trained from out-of-domain datasets, and they are interpolated into a mixture LM. For class LMs, semantic categories are used for class definition on medical terms, names, and digits. The interpolation weights of a mixture LM are estimated by a greedy algorithm of forward weight adjustment (FWA). The proposed mixing of in-domain class LMs and out-of-domain word LMs, the semantic definitions of word classes, as well as the weight-estimation algorithm of FWA are effective on the TC-VTC task. As compared with using mixtures of word LMs with weights estimated by the conventional expectation-maximization algorithm, the proposed methods led to a 21% reduction of perplexity on test sets of five doctors, which translated into improvements of captioning accuracy
机译:我们正在开发基于大型词汇会话语音识别的远程医疗视频咨询会议(TC-VTC)的自动字幕系统。在TC-VTC中,医生的讲话包含大量不经常使用的自发医学术语。由于数据不足,我们采用了混合语言建模,并从医学和非医学领域的多个数据集中训练了模型。本文提出了一种新的混合语言模型(LM)建模和估计方法。组件LM是从各个数据集中训练的,而n-gram LM是从域内的数据集中训练的,单词n-gram LM是从域外的数据集中训练的,然后被插值到混合LM中。对于LM类,语义类别用于医学术语,名称和数字的类别定义。通过前向权重调整(FWA)的贪婪算法来估计混合物LM的内插权重。提出的域内类LM和域外单词LM的混合,单词类的语义定义以及FWA的权重估计算法对于TC-VTC任务是有效的。与使用由传统的期望最大化算法估计的权重的单词LM的混合相比,所提出的方法使五位医生的测试集的困惑度降低了21%,这转化为字幕准确性的提高

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号