Extraction of Bilingual Technical Terms for Chinese-Japanese Patent Translation

机译：中日专利翻译双语技术术语摘录

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The translation of patents or scientific papers is a key issue that should be helped by the use of statistical machine translation (SMT). In this paper, we propose a method to improve Chinese-Japanese patent SMT by pre-marking the training corpus with aligned bilingual multi-word terms. We automatically extract multi-word terms from monolingual corpora by combining statistical and linguistic filtering methods. We use the sampling-based alignment method to identify aligned terms and set some threshold on translation probabilities to select the most promising bilingual multi-word terms. We pre-mark a Chinese-Japanese training corpus with such selected aligned bilingual multi-word terms. We obtain the performance of over 70% precision in bilingual term extraction and a significant improvement of BLEU scores in our experiments on a Chinese-Japanese patent parallel corpus.

机译：专利或科学论文的翻译是一个关键问题，应通过使用统计机器翻译（SMT）加以帮助。在本文中，我们提出了一种通过用对齐的双语多词术语预先标记训练语料来改善中日专利SMT的方法。我们通过结合统计和语言过滤方法，从单语语料库中自动提取多词术语。我们使用基于采样的对齐方法来识别对齐的术语，并为翻译概率设置一些阈值，以选择最有希望的双语多词术语。我们使用这样选择的对齐的双语多词术语为中日培训语料库预先标记。在我们对中日专利平行语料库的实验中，我们在双语术语提取中获得了70％以上的精度，并显着提高了BLEU分数。

著录项

来源
《Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies》|2016年|81-87|共7页
会议地点
作者
Wei Yang; Jinghui Yan; Yves Lepage;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Identifying Bilingual Synonymous Technical Terms from Phrase Tables and Parallel Patent Sentences [J] . Bing Liang, Takehito Utsuro, Mikio Yamamoto Procedia - Social and Behavioral Sciences . 2011,第2期

机译：从短语表和平行专利句中识别双语同义技术术语
2. Japanese Term Extraction Toward French-Japanese Bilingual Term Extraction on Wind Power Generation Domain [J] . Teruo KOYAMA, Shouzaburo MINAMOTO, Koichi TAKEUCHI, 電子情報通信学会技術研究報告 . 2012,第367期

机译：面向风能发电领域法日双语术语的日语术语提取
3. Japanese Term Extraction Toward French-Japanese Bilingual Term Extraction on Wind Power Generation Domain [J] . Koichi TAKEUCHI, Shouzaburo MINAMOTO, Emmanuel PLANAS, 電子情報通信学会技術研究報告. 言語理解とコミュニケーション. Natural Language Understanding and Models of Communication . 2012,第367期

机译：面向风能发电领域的法日双语术语的日语术语提取
4. Extraction of Bilingual Technical Terms for Chinese-Japanese Patent Translation [C] . Wei Yang, Jinghui Yan, Yves Lepage Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies . 2016

机译：中文专利翻译的双语技术术语提取
5. The representation of multiple translations in bilingual memory: An examination of lexical organization for concrete, abstract, and emotion words in Spanish-English bilinguals. [D] . Basnight-Brown, Dana M. 2009

机译：双语记忆中多种翻译的表示：西班牙语-英语双语者中具体，抽象和情感词的词汇组织检查。
6. Patent Keyword Extraction Algorithm Based on Distributed Representation for Patent Classification [O] . Jie Hu, Shaobo Li, Yong Yao, 2018

机译：基于专利分类的分布式表示的专利关键词提取算法
7. Evaluating Features for Identifying Japanese-Chinese Bilingual Synonymous Technical Terms from Patent Families [O] . Zi Long, Takehito Utsuro, Tomoharu Mitsuhashi, 2015

机译：评估专利家庭识别日语 - 中文双语同义技术术语的功能

Extraction of Bilingual Technical Terms for Chinese-Japanese Patent Translation

摘要

著录项

相似文献

相关主题

期刊订阅