Extraction of transliteration pairs from parallel corpora using a statistical transliteration model

Lee CJ; Chang JS; Jang JSR

首页> 外文期刊>Information Sciences: An International Journal >Extraction of transliteration pairs from parallel corpora using a statistical transliteration model

【24h】

Extraction of transliteration pairs from parallel corpora using a statistical transliteration model

机译：使用统计音译模型从平行语料库中提取音译对

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper describes a framework for modeling the machine transliteration problem. The parameters of the proposed model are automatically acquired through statistical learning from a bilingual proper name list. Unlike previous approaches, the model does not involve the use of either a pronunciation dictionary for converting source words into phonetic symbols or manually assigned phonetic similarity scores between Source and target words. We also report how the model is applied to extract proper names and corresponding transliterations from parallel corpora. Experimental results show that the average rates of word and character precision are 93.8% and 97.8%, respectively. (c) 2004 Elsevier Inc. All rights reserved.

机译：本文介绍了用于对机器音译问题进行建模的框架。通过统计学习从双语专有名称列表中自动获取所提出模型的参数。与以前的方法不同，该模型不涉及使用发音词典将源单词转换为语音符号，也没有使用手动分配的源单词和目标单词之间的语音相似性得分。我们还将报告该模型如何应用于从并行语料库中提取专有名称和相应的音译。实验结果表明，平均单词率和字符精度分别为93.8％和97.8％。（c）2004 Elsevier Inc.保留所有权利。

著录项

来源
《Information Sciences: An International Journal》 |2006年第1期|共24页
作者
Lee CJ; Chang JS; Jang JSR;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词
transliteration pair; transliteration rnodel; parallel corpora; statistical learning; machine transliteration; ALGORITHM;

机译：音译对;音译德尔德尔;平行语料库;统计学习;机器音译;算法;

相似文献

外文文献
中文文献
专利

1. Extraction of transliteration pairs from parallel corpora using a statistical transliteration model [J] . Lee CJ, Chang JS, Jang JSR Information Sciences: An International Journal . 2006,第1期

机译：使用统计音译模型从平行语料库中提取音译对
2. A Phonetic Similarity Model for Automatic Extraction of Transliteration Pairs [J] . JIN-SHEA KUO, HAIZHOU LI, YING-KUEI YANG ACM transactions on Asian language information processing . 2007,第2期

机译：自动提取音译对的语音相似度模型
3. Transliteration Pair Extraction from Classical Chinese Buddhist Literature Using Phonetic Similarity Measurement [J] . Yu-Chun WANG, Chun-Kai WU, Richard Tzong-Han TSAI, New Generation Computing . 2013,第4期

机译：利用语音相似度测量从中国古典佛教文献中音译对提取
4. Extraction of Name and Transliteration in Monolingual and Parallel Corpora [C] . Tracy Lin, Jian-Cheng Wu, Jason S. Chang Conference of the Association for Machine Translation in the Americas(AMTA 2004); 20040928-1002; Washington,DC(US) . 2004

机译：单语和平行语料库的名称提取和音译
5. Parallel and Distributed Statistical-based Extraction of Relevant Multiwords from Large Corpora [D] . Gon?alves, Carlos Jorge de Sousa. 2017

机译：大型语料库中基于并行和分布式统计的相关多词提取
6. A method for solving scriptio continua in Javanese manuscript transliteration [O] . Anastasia Rita Widiarti, Reza Pulungan 2020

机译：解决手稿爪哇音译连续书写的方法
7. Extraction of transliteration pairs from parallel corpora using a statistical transliteration model [O] . Chun-Jen Lee 2012

机译：使用统计音译模型从平行语料库中提取音译对

Extraction of transliteration pairs from parallel corpora using a statistical transliteration model

摘要

著录项

相似文献

相关主题

期刊订阅