Annotated Corpora for Word Alignment Between Japanese and English and its Evaluation with MAP-based Word Aligner

机译：注释语料库，用于日语和英语与基于地图的词对齐器的评估

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper presents two annotated corpora for word alignment between Japanese and English. We annotated on top of the IWSLT-2006 and the NTCIR-8 corpora. The IWSLT-2006 corpus is in the domain of travel conversation while the NTCIR-8 corpus is in the domain of patent. We annotated the first 500 sentence pairs from the IWSLT-2006 corpus and the first 100 sentence pairs from the NTCIR-8 corpus. After mentioned the annotation guideline, we present two evaluation algorithms how to use such hand-annotated corpora: although one is a well-known algorithm for word alignment researchers, one is novel which intends to evaluate a MAP-based word aligner of Okita et al. (2010b).

机译：本文展示了两种注释的日语和英语词语对齐的语料库。我们注释了IWSLT-2006和NTCIR-8 Corpora的顶部。 IWSLT-2006语料库是在旅行对话的领域，而NTCIR-8语料库是专利领域。我们注释了来自IWSLT-2006语料库的前500个句子对，以及来自NTCIR-8语料库的前100个句子对。在提到注释指南之后，我们呈现了两个评估算法如何使用这种手中的语料库：虽然一个是一个知名的单词对准研究人员算法，但是一个是新颖的，它打算评估okita等人的地图词对齐器。。（2010B）。

著录项

来源
《International conference on language resources and evaluation》|2012年||共8页
会议地点
作者
Tsuyoshi Okita;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算机软件;
关键词
Annotated Corpus for Word Alignment; Statistical Machine Translation; Evaluation;

机译：有关字对齐的注释语料库;统计机器翻译;评估;

相似文献

外文文献
中文文献
专利

1. A method of pronunciation evaluation for English words using Japanese's and English phonemic models [J] . Naoko Maeda, Yoichi Yamashita 電子情報通信学会技術研究報告. 音声. Speech . 2001,第604期

机译：利用日语和英语音位模型评估英语单词发音的方法
2. A method of pronunciation evaluation for English words using Japanese's and English phonemic models [J] . Naoko Maeda, Yoichi Yamashita 電子情報通信学会技術研究報告. 音声. Speech . 2001,第604期

机译：使用日语和英语音素模型对英语单词的发音评价方法
3. Self-organizing semantic maps and its application to word alignment in Japanese-Chinese parallel corpora. [J] . Ma Q, Kanzaki K, Zhang Y, Neural Networks: The Official Journal of the International Neural Network Society . 2004,第8a9期

机译：自组织语义图及其在日汉平行语料库中的词对齐中的应用。
4. Annotated Corpora for Word Alignment Between Japanese and English and its Evaluation with MAP-based Word Aligner [C] . Tsuyoshi Okita International conference on language resources and evaluation . 2012

机译：日语和英语之间单词对齐的带注释语料库及其基于MAP的单词对齐器的评估
5. Hypernym Discovery over WordNet and English Corpora - Using Hearst Patterns and Word Embeddings [D] . Vallabhajosyula, Manikya Swathi 2018

机译：通过WordNet和英语语料库发现Hypernym-使用赫斯特模式和单词嵌入
6. Improving the Alignment Quality of Consistency Based Aligners with an Evaluation Function Using Synonymous Protein Words [O] . Hsin-Nan Lin, Cédric Notredame, Jia-Ming Chang, 2011

机译：基于改进的一致性矫正器对齐质量的评价函数使用同义字蛋白
7. Fine-Grained Word Sense Disambiguation Based on Parallel Corpora, Word Alignment, Word Clustering and Aligned Wordnets [O] . Tufis, Dan, Ion, Radu, Ide, Nancy 2005

机译：基于平行语料库，Word的细粒度词义消歧对齐，Word聚类和对齐的Wordnets

Annotated Corpora for Word Alignment Between Japanese and English and its Evaluation with MAP-based Word Aligner

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅