【24h】

Phoneme-based Transliteration of Foreign Names for OOV Problem

机译:基于音素的OOV问题外文名称音译

获取原文
获取原文并翻译 | 示例

摘要

One problem seriously affecting CLIR performance is the processing of queries with embedded foreign names. A proper noun dictionary is never complete rendering name translation from English to Chinese ineffective. One way to solve this problem is not to rely on a dictionary alone but to adopt automatic translation according to pronunciation similarities, i.e. to map phonemes comprising an English name to sound units (e.g. pinyin) of the corresponding Chinese name. This process is called transliteration. We present a statistical transliteration method for CLIR applications. An efficient algorithm for phoneme alignment is described. Unlike traditional rule-based approaches, our method is data-driven. So it is independent of dialect features in Chinese. In addition, it is different from other statistical approaches based on source-channel framework in that we adopt a direct transliteration model, i.e. the direction of probabilistic estimation is consistent with transliteration direction. We demonstrate comparable performance on accuracy to other systems.
机译:严重影响CLIR性能的问题之一是带有嵌入式外部名称的查询的处理。专有名词词典永远不会完整地将名称从英语翻译为中文无效。解决该问题的一种方法不是仅依赖词典,而是根据发音相似度来采用自动翻译,即将包括英语名称的音素映射到对应的中文名称的声音单位(例如拼音)。此过程称为音译。我们提出一种用于CLIR应用程序的统计音译方法。描述了一种有效的音素对齐算法。与传统的基于规则的方法不同,我们的方法是数据驱动的。因此,它独立于中文的方言特征。另外,它与其他基于源通道框架的统计方法的不同之处在于,我们采用直接音译模型,即概率估计的方向与音译方向一致。我们展示出了与其他系统相比可比的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号