FASTSUBS: An Efficient and Exact Procedure for Finding the Most Likely Lexical Substitutes Based on an N-Gram Language Model

Yuret D.

首页> 外文期刊>Signal Processing Letters, IEEE >FASTSUBS: An Efficient and Exact Procedure for Finding the Most Likely Lexical Substitutes Based on an N-Gram Language Model

【24h】

FASTSUBS: An Efficient and Exact Procedure for Finding the Most Likely Lexical Substitutes Based on an N-Gram Language Model

机译：FASTSUBS：一种基于N-Gram语言模型查找最可能的词性替换的高效且精确的过程

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Lexical substitutes have found use in areas such as paraphrasing, text simplification, machine translation, word sense disambiguation, and part of speech induction. However the computational complexity of accurately identifying the most likely substitutes for a word has made large scale experiments difficult. In this letter we introduce a new search algorithm, fastsubs, that is guaranteed to find the $K$ most likely lexical substitutes for a given word in a sentence based on an n-gram language model. The computation is sublinear in both $K$ and the vocabulary size $V$ . An implementation of the algorithm and a dataset with the top 100 substitutes of each token in the WSJ section of the Penn Treebank are available at http://goo.gl/jzKH0.

机译：词汇替代词已在释义，简化文本，机器翻译，词义歧义消除和语音诱导等领域得到了使用。但是，准确识别单词的最可能替代词的计算复杂性使大规模实验变得困难。在这封信中，我们介绍了一种新的搜索算法fastsubs，该算法可确保在基于n元语法模型的句子中找到给定单词的$ K $最有可能的词汇替换。 $ K $和词汇量$ V $都是次线性的。可在http://goo.gl/jzKH0获得该算法的实现以及Penn Treebank的WSJ部分中每个令牌的前100个替换项的数据集。

著录项

来源
《Signal Processing Letters, IEEE》 |2012年第11期|p.725-728|共4页
作者
Yuret D.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Lexical substitutes; statistical language modeling;

机译：词汇替代;统计语言建模;

相似文献

外文文献
中文文献
专利

1. Converting Continuous-Space Language Models into N-gram Language Models with Efficient Bilingual Pruning for Statistical Machine Translation [J] . RUI WANG, MASAO UTIYAMA, ISAO GOTO, ACM transactions on Asian language information processing . 2016,第3期

机译：通过高效的双语修剪将连续空间语言模型转换为N-gram语言模型以进行统计机器翻译
2. An empirical study of statistical language models: n-gram language models vs. neural network language models [J] . Freha Mezzoudj, Abdelkader Benyettou International Journal of Innovative Computing and Applications . 2018,第4期

机译：统计语言模型的实证研究：n-gram语言模型与神经网络语言模型
3. Lexical training through modeling and elicitation procedures with late talkers who have specific language impairment and developmental delays. [J] . Kouri TA Journal of speech, language, and hearing research: JSLHR . 2005,第1期

机译：通过建模和启发程序对有特定语言障碍和发育迟缓的后期说话者进行词汇训练。
4. A Comparative Study of Likelihood Ratio Based Forensic Text Comparison Procedures: Multivariate Kernel Density with Lexical Features vs. Word N-grams vs. Character N-grams [C] . Ishihara Shunichi Cybercrime and Trustworthy Computing Workshop . 2015

机译：基于似然比的法医文本比较程序的比较研究：具有词法特征的多变量内核密度与单词N-grams与字符N-grams
5. Language-independent text learning with statistical n-gram language models. [D] . Peng, Fuchun. 2003

机译：统计n-gram语言模型的独立于语言的文本学习。
6. Modeling Actions of PubMed Users with N-Gram Language Models [O] . Jimmy Lin, W. John Wilbur -1

机译：N-Gram语言模型对PubMed用户的建模动作
7. FASTSUBS: an efficient and exact procedure for finding the most likely lexical substitutes based on an N-gram language model [O] . Yüret, Deniz 2015

机译：FASTSUBS：一种基于N-gram语言模型查找最可能的词汇替代词的有效且准确的过程
8. Investigation of Back-off Based Interpolation Between Recurrent Neural Network and N-gram Language Models (Author's Manuscript). [R] . Chen, X., Liu, X., Gales, M. J. F., 2016

机译：基于回退的递归神经网络与N-gram语言模型的插值研究（作者手稿）。

FASTSUBS: An Efficient and Exact Procedure for Finding the Most Likely Lexical Substitutes Based on an N-Gram Language Model

摘要

著录项

相似文献

相关主题

期刊订阅