首页> 外文会议>International Conference on Computational Linguistics >Splitting Input Sentence for Machine Translation Using Language Model with Sentence Similarity

【24h】

Splitting Input Sentence for Machine Translation Using Language Model with Sentence Similarity

机译：使用语言模型与句子相似性的机器翻译拆分输入句子

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In order to boost the translation quality of corpus-based MT systems for speech translation, the technique of splitting an input sentence appears promising. In previous research, many methods used N-gram clues to split sentences. In this paper, to supplement N-gram based splitting methods, we introduce another clue using sentence similarity based on edit-distance. In our splitting method, we generate candidates for sentence splitting based on N-grams, and select the best one by measuring sentence similarity. We conducted experiments using two EBMT systems, one of which uses a phrase and the other of which uses a sentence as a translation unit. The translation results on various conditions were evaluated by objective measures and a subjective measure. The experimental results show that the proposed method is valuable for both systems.

机译：为了提高基于语料库的MT系统的翻译质量，用于语音翻译，拆分输入句的技术似乎有前景。在以前的研究中，许多方法使用N-Gram线索来分裂句子。在本文中，为了补充基于n-gram的分裂方法，我们使用基于编辑距离的句子相似性介绍另一个线索。在我们的拆分方法中，我们基于N-GRAM生成句子分裂的候选者，并通过测量句子相似度选择最佳选择。我们使用两个EBMT系统进行实验，其中一个是使用短语，另一个使用句子作为翻译单元。通过客观措施和主观措施评估各种条件的翻译结果。实验结果表明，该方法对两个系统都是有价值的。

著录项

来源
《International Conference on Computational Linguistics 》|2004年||共7页
会议地点
作者
Takao Doi; Eiichiro Sumita;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序语言、算法语言 ;
关键词

相似文献

外文文献
中文文献
专利

1. Splitting Input for Machine Translation Using N-gram Language Model Together with Utterance Similarity [J] . Takao DOI, Eiichiro SUMITA IEICE Transactions on Information and Systems . 2005 ,第6期

机译：使用N-gram语言模型和话语相似性为机器翻译拆分输入
2. PROPOSED IF_THEN GRAMMAR TO TRANSLATE ENGLISH LANGUAGE SENTENCE TO AMERICAN SIGN LANGUAGE SENTENCE [J] . ASMAA M. HAMANDI, ALIAA K. ABDULHASSAN, HALA BAHJAT Journal of Theoretical and Applied Information Technology . 2016 ,第1期

机译：提议如果IF_THEN语法将英语语言句子转换为美国手语语言句子
3. Exploiting Parallel Sentences and Cosine Similarity for Identifying Target Language Translation [J] . Vijay Kumar Sharma, Namita Mittal Procedia Computer Science . 2016 ,第1期

机译：利用平行句子和余弦相似度识别目标语言翻译
4. Splitting Input Sentence for Machine Translation Using Language Model with Sentence Similarity [C] . Takao Doi, Eiichiro Sumita International Conference on Computational Linguistics . 2004

机译：使用语言模型与句子相似性的机器翻译拆分输入句子
5. Hybrid System Combination for Machine Translation: An Integration of Phrase-level and Sentence-level Combination Approaches. [D] . Ma, Wei-Yun. 2014

机译：机器翻译的混合系统组合：短语级和句子级组合方法的集成。
6. A disadvantage in bilingual sentence production modulated by syntactic frequency and similarity across languages [O] . Elin Runnqvist, Tamar H. Gollan, Albert Costa, -1

机译：句法频率和跨语言相似性对双语句子产生的不利影响
7. Splitting Input Sentence for Machine Translation Using Language Model with Sentence Similarity [O] . Takao Doi, Eiichiro Sumita 2013

机译：用句子相似度语言模型分割机器翻译输入句

Splitting Input Sentence for Machine Translation Using Language Model with Sentence Similarity

摘要

著录项

相似文献

相关主题

期刊订阅