首页> 外文会议>Conference of the European Chapter of the Association for Computational Linguistics >Unsupervised Training for Large Vocabulary Translation Using Sparse Lexicon and Word Classes

【24h】

Unsupervised Training for Large Vocabulary Translation Using Sparse Lexicon and Word Classes

机译：使用稀疏词典和单词类进行大词汇量翻译的无监督培训

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We address for the first time unsupervised training for a translation task with hun dreds of thousands of vocabulary words. We scale up the expectation-maximization (EM) algorithm to learn a large translation table without any parallel text or seed lex icon. First, we solve the memory bottle neck and enforce the sparsity with a sim ple thresholding scheme for the lexicon. Second, we initialize the lexicon training with word classes, which efficiently boosts the performance. Our methods produced promising results on two large-scale unsu pervised translation tasks.

机译：我们首次解决了无数种成千上万个单词的翻译任务的无监督培训。我们扩大了期望最大化（EM）算法，以学习没有任何并行文本或种子lex图标的大型翻译表。首先，我们解决了内存瓶颈，并为词典提供了简单的阈值处理方案来实现稀疏性。其次，我们使用单词类初始化词典训练，从而有效地提高了性能。我们的方法在两项未经监督的大规模翻译任务中产生了可喜的结果。

著录项

来源
《Conference of the European Chapter of the Association for Computational Linguistics 》|2017年|650-656|共7页
会议地点
作者
Yunsu Kim; Julian Schamper; Hermann Ney;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Querying out-of-vocabulary words in lexicon-based keyword spotting [J] . Puigcerver Joan, Toselli Alejandro H., Vidal Enrique Neural computing & applications . 2017 ,第9期

机译：查询基于词汇的关键字斑点中的词汇单词
2. Unsupervised Word Segmentation and Lexicon Discovery Using Acoustic Word Embeddings [J] . Herman Kamper, Aren Jansen, Sharon Goldwater Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2016 ,第4期

机译：使用声词嵌入的无监督分词和词典发现
3. Towards Integrated Classification Lexicon for Handling Unknown Words in Chinese-Vietnamese Neural Machine Translation [J] . WANJIN CHE, ZHENGTAO YU, ZHIQIANG YU, ACM transactions on Asian and low-resource language information processing . 2020 ,第3期

机译：朝着综合分类词典，用于处理中越神经电脑翻译中未知词
4. Unsupervised Training for Large Vocabulary Translation Using Sparse Lexicon and Word Classes [C] . Yunsu Kim, Julian Schamper, Hermann Ney Conference of the European Chapter of the Association for Computational Linguistics . 2017

机译：使用稀疏词汇和Word类的大词汇翻译无监督培训
5. Do Spellings of Words and Phonemic Awareness Training Facilitate Vocabulary Learning in Preschoolers? [D] . O'Leary, Robin. 2017

机译：单词拼写和音素意识训练是否有助于学龄前儿童的词汇学习？
6. Words matter: towards a new lexicon for ‘nontechnical skills’ training [O] . Paul Murphy, Debra Nestel, Gerard J. Gormley 2019

机译：话语很重要：针对非技术技能培训的新词典
7. Unsupervised Training for Large Vocabulary Translation Using Sparse Lexicon and Word Classes [O] . Kim, Yunsu, Schamper, Julian, Ney, Hermann 2017

机译：使用稀疏词典和单词类进行大词汇量翻译的无监督培训

Unsupervised Training for Large Vocabulary Translation Using Sparse Lexicon and Word Classes

摘要

著录项

相似文献

相关主题

期刊订阅