Language Model Adaptation Based on PLSA of Topics and Speakers for Automatic Transcription of Panel Discussions

Yuya AKITA; Tatsuya KAWAHARA

首页> 外文期刊>IEICE Transactions on Information and Systems >Language Model Adaptation Based on PLSA of Topics and Speakers for Automatic Transcription of Panel Discussions

【24h】

Language Model Adaptation Based on PLSA of Topics and Speakers for Automatic Transcription of Panel Discussions

机译：基于主题和说话者的PLSA的语言模型自适应以自动翻译小组讨论

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Appropriate language modeling is one of the major issues for automatic transcription of spontaneous speech. We propose an adaptation method for statistical language models based on both topic and speaker characteristics. This approach is applied for automatic transcription of meetings and panel discussions, in which multiple participants speak on a given topic in their own speaking style. A baseline language model is a mixture of two models, which are trained with different corpora covering various topics and speakers, respectively. Then, probabilistic latent semantic analysis (PLSA) is performed on the same respective corpora and the initial ASR result to provide two sets of unigram probabilities conditioned on input speech, with regard to topics and speaker characteristics, respectively. Finally, the baseline model is adapted by scaling N-gram probabilities with these unigram probabilities. For speaker adaptation purpose, we make use of a portion of the Corpus of Spontaneous Japanese (CSJ) in which a large number of speakers gave talks for given topics. Experimental evaluation with real discussions showed that both topic and speaker adaptation reduced test-set perplexity, and in total, an average reduction rate of 8.5% was obtained. Furthermore, improvement on word accuracy was also achieved by the proposed adaptation method.

机译：适当的语言建模是自发语音自动转录的主要问题之一。我们提出了一种基于主题和说话者特征的统计语言模型的适应方法。这种方法适用于会议和小组讨论的自动转录，其中多个参与者以自己的讲话方式就给定主题发言。基准语言模型是两种模型的混合，分别使用涵盖不同主题和说话者的不同语料库进行训练。然后，对相同的相应语料库和初始ASR结果执行概率潜在语义分析（PLSA），以分别针对主题和说话者特征提供两组以输入语音为条件的字母组合概率。最终，通过用这些单字母组概率缩放N-gram概率来适应基线模型。为了使演讲者适应，我们利用了自发日语语料库（CSJ）的一部分，其中大量演讲者针对给定主题进行了演讲。通过实际讨论进行的实验评估表明，主题和说话者的适应能力均降低了测试集的困惑，总的来说，平均降低率为8.5％。此外，通过提出的自适应方法还实现了单词准确性的提高。

著录项

来源
《IEICE Transactions on Information and Systems》 |2005年第3期|p.439-445|共7页
作者
Yuya AKITA; Tatsuya KAWAHARA;
展开▼
作者单位

School of Informatics, Kyoto University, Kyoto-shi, 606-8501 Japan, and also with PRESTO, Japan Science and Technology Agency (JST);

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类无线电电子学、电信技术;
关键词
language model; topic adaptation; speaker adaptation; PLSA; automatic speech recognition;

机译：语言模型;主题适应;说话人适应;PLSA;自动语音识别;

相似文献

外文文献
中文文献
专利

1. Trigger-Based Language Model Adaptation for Automatic Transcription of Panel Discussions [J] . Carlos TRONCOSO, Tatsuya KAWAHARA IEICE Transactions on Information and Systems . 2006,第3期

机译：基于触发器的语言模型自适应，可自动转录小组讨论
2. Language Modeling Using PLSA-Based Topic HMM [J] . Atsushi SAKO, Tetsuya TAKIGUCHI, Yasuo ARIKI IEICE Transactions on Information and Systems . 2008,第3期

机译：使用基于PLSA的主题HMM进行语言建模
3. Speaker indexing based on speaker model selection and automatic speech recognition in discussions [J] . Masafumi Nishida, Yuya Akita, Tatsuya Kawahara 電子情報通信学会技術研究報告. 音声. Speech . 2002,第530期

机译：讨论中基于说话人模型选择和自动语音识别的说话人索引
4. Language Model Adaptation based on PLSA of Topics and Speakers [C] . Yuya Akita, Tatsuya Kawahara International Conference on Spoken Language Processing; 20041004-08; Jeju(KR) . 2004

机译：基于主题和说话者的PLSA的语言模型适应
5. Rapid Speaker Normalization and Adaptation with Applications to Automatic Evaluation of Children's Language Learning Skills. [D] . Wang, Shizhen. 2010

机译：快速的说话人归一化和适应，并应用于儿童语言学习技能的自动评估。
6. Translation adaptation and validation of two versions of the Chronic Liver Disease Questionnaire in Malaysian patients for speakers of both English and Malay languages: a cross-sectional study [O] . Shasha Khairullah, Sanjiv Mahadeva 2017

机译：横断面研究在马来西亚患者中翻译改编和验证了两种版本的马来西亚慢性肝病问卷适用于讲英语和马来语的人
7. PLSA BASED TOPIC MIXTURE LANGUAGE MODELING APPROACH [O] . Shuanhu Bai, Haizhou Li 2013

机译：基于PLSA的主题混合语言建模方法
8. Speaker Adaptation of Language Models for Automatic Dialog Act Segmentation of Meetings [R] . Kolar, J. , Liu, Y. , Shriberg, E. 2007

机译：会议自动对话行为分割的语言模型演讲者自适应

Language Model Adaptation Based on PLSA of Topics and Speakers for Automatic Transcription of Panel Discussions

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅