【24h】

Finding Cross-Lingual Spelling Variants

机译:寻找交叉语言拼写变体

获取原文
获取外文期刊封面目录资料

摘要

Finding term translations as cross-lingual spelling variants on the fly is an important problem for cross-lingual information retrieval (CLIR). CLIR is typically approached by automatically translating a query into the target language. For an overview of cross-lingual information retrieval, see [1]. When automatically translating the query, specialized terminology is often missing from the translation dictionary. The analysis of query properties in [2] has shown that proper names and technical terms often are prime keys in queries, and if not properly translated or transliterated, query performance may deteriorate significantly. As proper names often need no translation, a trivial solution is to include the untranslated keys as such into the target language query. However, technical terms in European languages often have common Greek or Latin roots, which allows for a more advanced solution using approximate string matching to find the word or words most similar to the source keys in the index of the target language text database.
机译:在飞行中发现术语翻译是交叉语言拼写变体是交叉语言信息检索(CLIR)的重要问题。通常通过将查询转换为目标语言来接近CLIR。有关交叉信息检索的概述,请参阅[1]。当自动翻译查询时,翻译字典通常丢失专门术语。 [2]中查询属性的分析表明,正确的名称和技术术语通常是查询中的主要键,如果没有正确翻译或翻译,则查询性能可能会显着恶化。由于正确的名称通常不需要翻译,琐碎的解决方案是将未翻译的密钥包含在目标语言查询中。然而,欧洲语言的技术术语通常具有普通的希腊语或拉丁根,这允许使用近似字符串匹配的更高级解决方案,以查找与目标语言文本数据库索引中的源键最相似的单词或单词。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号