【24h】

English to Persian Transliteration

机译:英语到波斯语音译

获取原文
获取原文并翻译 | 示例

摘要

Persian is an Indo-European language written using Arabic script, and is an official language of Iran, Afghanistan, and Tajikistan. Transliteration of Persian to English—that is, the character-by-character mapping of a Persian word that is not readily available in a bilingual dictionary—is an unstudied problem. In this paper we make three novel contributions. First, we present performance comparisons of existing grapheme-based transliteration methods on English to Persian. Second, we discuss the difficulties in establishing a corpus for studying transliteration. Finally, we introduce a new model of Persian that takes into account the habit of shortening, or even omitting, runs of English vowels. This trait makes transliteration of Persian particularly difficult for phonetic based methods. This new model outperforms the existing grapheme based methods on Persian, exhibiting a 24% relative increase in transliteration accuracy measured using the top-5 criteria.
机译:波斯语是使用阿拉伯文字书写的印欧语系语言,是伊朗,阿富汗和塔吉克斯坦的官方语言。波斯语到英语的音译(即,双语词典中不易找到的波斯语单词的逐字符映射)是一个尚未研究的问题。在本文中,我们做出了三点新颖的贡献。首先,我们将现有的基于字形的音译方法在英语到波斯语上的性能进行比较。其次,我们讨论了建立研究音译语料库的困难。最后,我们介绍了一种新的波斯语模型,其中考虑了缩短甚至省略英语元音的习惯。对于基于语音的方法,此特征使波斯语的音译特别困难。这个新模型的表现优于波斯语上基于现有字素的方法,使用前5个标准测得的音译准确度相对提高了24%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号