首页> 外文会议>Proceedings of 2011 International Conference on Machine Learning and Cybernetics >A CLIR-oriented OOV translation mining method from bilingual webpages
【24h】

A CLIR-oriented OOV translation mining method from bilingual webpages

机译:一种基于CLIR的双语网页OOV翻译挖掘方法

获取原文

摘要

Translating unknown terms is a major bottleneck for cross-language IR. An effective solution to relevant webpage detection, translation extraction with correct boundaries, and candidate translation ranking is proposed. Topic word translations are used to expand the source query and collect bilingual search engine snippets. Then an improved Frequency Change Measurement method is used to extract valid candidates from noisy, small bilingual corpora. To choose the translation, frequency-distance, surface patterns and phonetic features are used to pick out the correct translation. Experimental results show an impressive performance for unknown term translation mining.
机译:翻译未知术语是跨语言IR的主要瓶颈。提出了一种有效的解决方案,用于相关网页检测,具有正确边界的翻译提取和候选翻译排名。主题词翻译用于扩展源查询并收集双语搜索引擎片段。然后使用一种改进的“频率变化测量”方法从嘈杂的小型双语语料库中提取有效候选项。要选择翻译,请使用频率距离,表面图案和语音特征来选择正确的翻译。实验结果表明,对于未知术语翻译挖掘,其性能令人印象深刻。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号