首页> 外文期刊>Information Processing & Management >Learning regional transliteration variants
【24h】

Learning regional transliteration variants

机译:学习区域音译变体

获取原文
获取原文并翻译 | 示例
       

摘要

This paper conducts an inquiry into regional transliteration variants across Chinese speaking regions. We begin by studying the social association of regional transliterations, followed by postulating a computational model for effective transliteration extraction from the Web. In the computational model, we first propose constraint-based exploration by incorporating transliteration knowledge from transliteration modeling and predictive query suggestions from search engines into query formulation as constraints so as to increase the chance of desired transliteration returns in learning regional transliteration variants. Then, we study a cross-training algorithm, which explores the attainably helpful information of transliteration mappings across related regional corpora for the learning of transliteration models, to improve the overall extraction performance. The experimental results show that the proposed method not only effectively harvests a lexicon of regional transliteration variants but also mitigates the need of manual data labeling for transliteration modeling. We also carry out an investigation into the underlying characteristics of regional transliterations that motivate the cross-training algorithm.
机译:本文对汉语地区之间的音译变体进行了调查。我们首先研究区域音译的社会关联,然后提出一个用于从Web进行有效音译提取的计算模型。在计算模型中,我们首先通过将来自音译模型的音译知识和来自搜索引擎的预测性查询建议纳入约束条件的查询公式中,提出基于约束的探索,以增加学习区域音译变体所需的音译收益的机会。然后,我们研究一种交叉训练算法,该算法探索跨相关区域语料库的音译映射可获得的有用信息,以学习音译模型,从而提高整体提取性能。实验结果表明,所提出的方法不仅有效地收获了区域音译变种的词典,而且减轻了音译建模中手动数据标记的需要。我们还对激励跨训练算法的区域音译的潜在特征进行了调查。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号