A Scalable Distributed Syntactic, Semantic, and Lexical Language Model

Ming Ta; Wenli Zho; Lei Zhen; Shaojun Wan

首页> 外文期刊>Computational linguistics >A Scalable Distributed Syntactic, Semantic, and Lexical Language Model

【24h】

A Scalable Distributed Syntactic, Semantic, and Lexical Language Model

机译：可扩展的分布式句法，语义和词汇语言模型

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents an attempt at building a large scale distributed composite language model that is formed by seamlessly integrating an n-gram model, a structured language model, and probabilistic latent semantic analysis under a directed Markov random field paradigm to simultaneously account for local word lexical information, mid-range sentence syntactic structure, and long-span document semantic content. The composite language model has been trained by performing a convergent N-best list approximate EM algorithm and a follow-up EM algorithm to improve word prediction power on corpora with up to a billion tokens and stored on a supercomputer. The large scale distributed composite language model gives drastic perplexity reduction over n-grams and achieves significantly better translation quality measured by the Bleu score and “readability” of translations when applied to the task of re-ranking the N-best list from a state-of-the-art parsing-based machine translation system.

机译：本文提出了构建大型分布式复合语言模型的尝试，该模型是通过在有向马尔可夫随机字段范式下无缝集成n元语法模型，结构化语言模型和概率潜在语义分析以同时解释本地词词法而形成的信息，中档句法结构和大跨度文档语义内容。通过执行收敛的N最佳列表近似EM算法和后续的EM算法来训练复合语言模型，以提高具有多达十亿个令牌的语料库上的单词预测能力，并将其存储在超级计算机上。大规模的分布式复合语言模型可将N-gram的混乱程度大大降低，并通过将Bleu得分和翻译的“可读性”衡量出来的翻译质量（从州/州/直辖市/直辖市/直辖市/直辖市/直辖市/直辖市/直辖市/直辖市/直辖市/直辖市/直辖市/直辖市/直辖市/直辖市/直辖市/直辖市/直辖市/直辖市）办公室开始重新排序，可以显着提高翻译质量。最先进的基于解析的机器翻译系统。

著录项

来源
《Computational linguistics》 |2012年第3期|共41页
作者
Ming Ta; Wenli Zho; Lei Zhen; Shaojun Wan;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
入库时间 2022-08-18 06:58:26

相似文献

外文文献
中文文献
专利

1. EXPLOITING SYNTACTIC, SEMANTIC, AND LEXICAL REGULARITIES IN LANGUAGE MODELING VIA DIRECTED MARKOV RANDOM FIELDS [J] . Shaojun Wang, Shaomin Wang, Li Cheng, Computational Intelligence . 2013,第4期

机译：通过直接的马尔可夫随机域探索语言建模中的句法，语义和词法规则
2. Neural networks involved in learning lexical-semantic and syntactic information in a second language [J] . Jutta L. Mueller, Shirley-Ann Rueschemeyer, Kentaro Ono, Frontiers in Psychology . 2014,第4期

机译：用于学习第二语言的词汇语义和句法信息的神经网络
3. Language and ToM development in autism versus Asperger syndrome: Contrasting influences of syntactic versus lexical/semantic maturity [J] . Jessica Paynter, Candida Peterson Research in autism spectrum disorders . 2010,第3期

机译：自闭症与阿斯伯格综合症的语言和ToM发展：句法与词汇/语义成熟度的对比影响
4. A Large Scale Distributed Syntactic, Semantic and Lexical Language Model for Machine Translation [C] . Ming Tan, Wenli Zhou, Lei Zheng, Annual meeting of the Association for Computational Linguistics;ACL 2011 . 2012

机译：机器翻译的大规模分布式句法，语义和词汇语言模型
5. Lexical processing in sentence context: Semantic and syntactic factors. [D] . Fancher, Eileen. 2014

机译：句子上下文中的词汇处理：语义和句法因素。
6. Neural networks involved in learning lexical-semantic and syntactic information in a second language [O] . Jutta L. Mueller, Shirley-Ann Rueschemeyer, Kentaro Ono, 2014

机译：神经网络以第二语言学习词汇语义和句法信息
7. A Scalable Distributed Syntactic, Semantic, and Lexical Language Model [O] . Ming Tan, Wenli Zhou, Lei Zheng, 2013

机译：可扩展的分布式句法，语义和词汇语言模型
8. Scalable Distributed Syntactic, Semantic, and Lexical Language Model. [R] . Tan, M., Zhou, W., Zheng, L., 2012

机译：可扩展的分布式句法，语义和词汇语言模型。

A Scalable Distributed Syntactic, Semantic, and Lexical Language Model

摘要

著录项

相似文献

相关主题

期刊订阅