Linguistic-Relationships-Based Approach for Improving Word Alignment

PHUOC TRAN; DIEN DINH; TAN LE; LONG H. B. NGUYEN

首页> 外文期刊>ACM transactions on Asian language information processing >Linguistic-Relationships-Based Approach for Improving Word Alignment

【24h】

Linguistic-Relationships-Based Approach for Improving Word Alignment

机译：基于语言关系的改进单词对齐方式的方法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The unsupervised word alignments (such as GIZA++) are widely used in the phrase-based statistical machine translation. The quality of the model is proportional to the size and the quality of the bilingual corpus. However, for low-resource language pairs such as Chinese and Vietnamese, a result of unsupervised word alignment sometimes is of low quality due to the sparse data. In addition, this model does not take advantage of the linguistic relationships to improve performance of word alignment. Chinese and Vietnamese have the same language type and have close linguistic relationships. In this article, we integrate the characteristics of linguistic relationships into the word alignment model to enhance the quality of Chinese-Vietnamese word alignment. These linguistic relationships are Sino-Vietnamese and content word. The experimental results showed that our method improved the performance of word alignment as well as the quality of machine translation.

机译：无监督的单词对齐方式（例如GIZA ++）广泛用于基于短语的统计机器翻译中。模型的质量与双语语料库的大小和质量成正比。但是，对于资源稀少的语言对（例如中文和越南语），由于数据稀疏，无监督单词对齐的结果有时质量较低。另外，该模型没有利用语言关系来改善单词对齐的性能。中文和越南文具有相同的语言类型，并且在语言上有密切的关系。在本文中，我们将语言关系的特征整合到单词对齐模型中，以提高汉语-越南语单词对齐的质量。这些语言关系是中越和内容词。实验结果表明，我们的方法提高了单词对齐的性能以及机器翻译的质量。

著录项

来源
《ACM transactions on Asian language information processing》 |2018年第1期|5.1-5.16|共16页
作者
PHUOC TRAN; DIEN DINH; TAN LE; LONG H. B. NGUYEN;
展开▼
作者单位

Ton Duc Thang University;

VNU University of Science;

Universite Du Quebec A Montreal;

VNU University of Science;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Word alignment; linguistic relationships; Chinese-Vietnamese machine translation; Sino-Vietnamese; content word;

机译：单词对齐;语言关系;汉越机器翻译;中越;内容词;
入库时间 2022-08-18 04:03:41

相似文献

外文文献
中文文献
专利

1. ReformAlign: improved multiple sequence alignments using a profile-based meta-alignment approach [J] . Dimitrios P Lyras, Dirk Metzler BMC Bioinformatics . 2014,第1期

机译：RegiceAlign：使用基于型材的元对准方法改进多个序列对齐
2. Improving neural sentence alignment with word translation [J] . Ying DING, Junhui LI, Zhengxian GONG, Frontiers of computer science . 2021,第1期

机译：用词翻译改善神经句子对齐
3. Bilingual lexical extraction based on word alignment for improving corpus search [J] . Andonovski Jelena, Sandrih Branislava, Kitanovic Olivera The Electronic Library . 2019,第4期

机译：基于词对齐的双语词汇提取改善语料库搜索
4. Improving Word Alignment of Rare Words with Word Embeddings [C] . Masoud Jalili Sabet, Heshaam Faili, Gholamreza Haffari International conference on computational linguistics . 2016

机译：通过词嵌入改善稀有词的词对齐
5. Improved word alignments for statistical machine translation. [D] . Fraser, Alexander. 2007

机译：改进了单词对齐，以进行统计机器翻译。
6. Sequence Comparison Alignment-Free Approach Based on Suffix Tree and L-Words Frequency [O] . Inês Soares, Ana Goios, António Amorim 2012

机译：基于后缀树和L词频率的序列比较无比对方法
7. Improving Word Alignment using Word Similarity [O] . Theerawat Songyot, David Chiang 2015

机译：使用单词相似度改善单词对齐

Linguistic-Relationships-Based Approach for Improving Word Alignment

摘要

著录项

相似文献

相关主题

期刊订阅