Refine bigram PLSA model by assigning latent topics unevenly

机译：通过不均匀地分配潜在主题来精确炼大轮型PLSA模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

As an important component in many speech and language processing applications, statistical language model has been widely investigated. The bigram topic model, which combines advantages of both the traditional n-gram model and the topic model, turns out to be a promising language modeling approach. However, the original bigram topic model assigns the same topic number for each context word but ignores the fact that there are different complexities to the latent semantics of context words. we present a new bigram topic model, the bigram PLSA model, and propose a modified training strategy that unevenly assigns latent topics to context words according to an estimation of their latent semantic complexities. As a consequence, a refined bigram PLSA model is reached. Experiments on HUB4 Mandarin test transcriptions reveal the superiority over existing models and further performance improvements on perplexity are achieved through the use of the refined bigram PLSA model.

机译：作为许多语音和语言处理应用中的重要组成部分，统计语言模型已被广泛调查。 BIGRAM主题模型结合了传统的N-GRAM模型和主题模型的优势，结果是一种有希望的语言建模方法。但是，原始BIGRAM主题模型为每个上下文字分配相同的主题编号，但忽略了对上下文单词的潜在语义存在不同的复杂性。我们展示了一个新的Bigram主题模型，Bigram PLSA模型，并提出了一种修改的培训策略，根据其潜在语义复杂性的估计，不均匀地分配潜在主题。因此，达到了精炼的BigRam PLSA模型。 Hub4普通话试验转录的实验揭示了现有模型的优越性，通过使用精制的Bigram PLSA模型实现了对困惑的进一步性能改进。

著录项

来源
《IEEE Workshop on Automatic Speech Recognition and Understanding》|2007年||共6页
会议地点
作者
Jiazhong Nie; Runxin Li; Dingsheng Luo; Xihong Wu; ASRU;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类自动化技术、计算机技术;
关键词
PLSA; bigram topic model; language model; latent sematic;

机译：PLSA;BIGRAM主题模型;语言模型;潜在语义;

相似文献

外文文献
中文文献
专利

1. A new bigram-PLSA language model for speech recognition [J] . Bahrani M., Sameti H. EURASIP journal on advances in signal processing . 2010,第12期

机译：用于语音识别的新型bigram-PLSA语言模型
2. A New Bigram-PLSA Language Model for Speech Recognition [J] . Mohammad Bahrani, Hossein Sameti EURASIP journal on advances in signal processing . 2010,第1期

机译：用于语音识别的新型Bigram-PLSA语言模型
3. Efficient algorithms for graph regularized PLSA for probabilistic topic modeling [J] . Wang Xin, Chang Ming-Ching, Wang Lan, Pattern Recognition: The Journal of the Pattern Recognition Society . 2019,第期

机译：用于概率主题建模的图形正则化PLSA的高效算法
4. Refine bigram PLSA model by assigning latent topics unevenly [C] . Jiazhong Nie, Runxin Li, Dingsheng Luo, IEEE Workshop on Automatic Speech Recognition and Understanding . 2007

机译：通过不均匀地分配潜在主题来精确炼大轮型PLSA模型
5. Joint-Stochastic Spectral Inference for Robust Co-Occurrence Modeling and Latent Topic Analysis [D] . Lee, Moontae. 2018

机译：鲁棒共同发生建模和潜在谱分析的关节随机谱推断
6. Inferring Latent States and Refining Force Estimates via Hierarchical Dirichlet Process Modeling in Single Particle Tracking Experiments [O] . Christopher P. Calderon, Kerry Bloom -1

机译：通过单粒子跟踪实验中的分层Dirichlet过程建模推断潜在状态和细化力估计
7. PLSA Enhanced with a Long-distance Bigram Language Model for Speech Recognition [O] . Haidar Md. Akmal, OShaughnessy D. 2013

机译：PLSA增强了长距离Bigram语言模型用于语音识别

Refine bigram PLSA model by assigning latent topics unevenly

摘要

著录项

相似文献

相关主题

期刊订阅