Building a Bilingual Dictionary from a Japanese-Chinese Patent Corpus

机译：从日本 - 中国专利语料库中建立双语词典

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose an automatic method to build a bilingual dictionary from a Japanese-Chinese parallel corpus. The proposed method uses character similarity between Japanese and Chinese, and a statistical machine translation (SMT) framework in a cascading manner. The first step extracts word translation pairs from the parallel corpus based on similarity between Japanese kanji characters (Chinese characters used in Japanese writing) and simplified Chinese characters. The second step trains phrase tables using 2 different SMT training tools, then extracts common word translation pairs. The third step trains an SMT system using the word translation pairs obtained by the first and the second steps. According to the experimental results, the proposed method yields 59.3% to 92.1% accuracy in the word translation pairs extracted, depending on the cascading step.

机译：在本文中，我们提出了一种自动方法来构建日语和日语并联语料库中的双语词典。该方法使用日语和中文之间的性格相似，以及级联方式的统计机器翻译（SMT）框架。第一步从并行语料库中提取单词转换对基于日语Kanji字符之间的相似性（日语写作中的汉字）和简体中文字符。第二步列车用2个不同的SMT训练工具短语表，然后提取公共词转换对。第三步使用由第一和第二步骤获得的字转换对进行SMT系统。根据实验结果，提出的方法在提取的转换对中提取的精度为59.3％至92.1％，取决于级联步骤。

著录项

来源
《Conference on Intelligent Text Processing and Computational Linguistics》|2013年||共9页
会议地点
作者
Keiji Yasuda; Eiichiro Sumita;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 006.3/5;
关键词
Dictionary; Japanese; Patent;

机译：字典;日语;专利;

相似文献

外文文献
中文文献
专利

1. Construction of a Japanese-Chinese Bilingual Dictionary Using English as an Intermediary [J] . YUJIE ZHANG, QING MA, HITOSHI ISAHARA International Journal of Computer Processing of Oriental Languages . 2005,第1期

机译：以英语为中介构建日汉双语词典
2. Turkish synonym identification from multiple resources: monolingual corpus, mono/bilingual online dictionaries, and WordNet [J] . TU?BA YILDIZ, BANU D?R?, SAVA? YILDIRIM Turkish Journal of Electrical Engineering and Computer Sciences . 2017,第2期

机译：来自多种资源的土耳其语同义词识别：单语语料库，单语/双语在线词典和WordNet
3. The Building Blocks of Child Bilingual Code-Mixing: A Cross-Corpus Traceback Approach [J] . Antje Endesfelder Quick, Stefan Hartmann Frontiers in Psychology . 2021,第a期

机译：儿童双语码混合的构建块：交叉语料库回溯方法
4. Building a Bilingual Dictionary from a Japanese-Chinese Patent Corpus [C] . Keiji Yasuda, Eiichiro Sumita International conference on intelligent text processing and computational linguistics . 2013

机译：用日汉专利语料库建立双语词典
5. Habitus of Deafhood: Compiling a corpus-based academic ASL dictionary using the sociolinguistic practices of Deaf individuals. [D] . Cobb, Gretchen Thom. 2017

机译：耳聋的习惯：使用聋人的社会语言习惯，编写基于语料库的学术性ASL词典。
6. The Building Blocks of Child Bilingual Code-Mixing: A Cross-Corpus Traceback Approach [O] . Antje Endesfelder Quick, Stefan Hartmann 2021

机译：儿童双语码混合的构建块：交叉语料库回溯方法
7. Combining Corpus and Machine-Readable Dictionary Data for Building Bilingual Lexicons [O] . Judith Klavans, Evelyne Tzoukermann 1996

机译：结合语料库和机器可读词典数据以构建双语词典

Building a Bilingual Dictionary from a Japanese-Chinese Patent Corpus

摘要

著录项

相似文献

相关主题

期刊订阅