首页> 外文会议>Workshop of the Cross-Language Evaluation Forum >Dictionary-Based Thai CLIR: An Experimental Survey of Thai CLIR
【24h】

Dictionary-Based Thai CLIR: An Experimental Survey of Thai CLIR

机译:清除泰国词典:泰国清晰的实验调查

获取原文

摘要

This paper describes our participation in the Cross-Language Evaluation Forum. Our objectives for this experiment were three-fold. Firstly, the coverage of the Thai bilingual dictionary was evaluated when translating queries. Secondly, we investigated whether the segmentation process affected the CLIR. Lastly, this research examines query formation techniques. Since this is our first international experimental in CLIR, our approach used dictionary-based techniques to translate Thai queries into English queries. Four runs were submitted to CLEF: (a) single mapping translation with manual segmentation, (b) multiple mapping translation with manual segmentation, (c) single mapping translation with automatic segmentation and (d) single mapping with query enhancement using words from our Thai thesaurus. The retrieval effectiveness was worse than we expected. The simple dictionary mapping technique is unable to achieve retrieval effectiveness, although the dictionary lookup gave a very high percentage of mapping words. The words from the dictionary lookup are not specific terms but each is mapped to a definition or meaning of that term. Furthermore, Thai stopwords, stemmed words and word separation have reduced the effectiveness of Thai CLIR.
机译:本文介绍了我们参与跨语言评估论坛。我们对此实验的目标是三倍。首先,在翻译查询时评估泰式双语词典的覆盖范围。其次,我们调查了分割过程是否影响了CLIR。最后,这项研究检查了查询形成技术。由于这是我们在CLIR的第一个国际实验中,我们的方法使用了基于字典的技术来将泰语查询翻译成英文查询。四个运行提交到谱号:(a)单映射转换与手动分段,(b)与手动分段的多个映射转换,(c)单映射转换与自动分割和(d)使用泰语中的单词使用查询增强的单个映射转换词库。检索效果比我们预期的更糟糕。简单的字典映射技术无法实现检索效果,尽管字典查找产生了非常高的映射词。来自字典查找的单词不是特定的术语,但每个字段都映射到该术语的定义或含义。此外,泰语止动件,茎和字分离减少了泰国CLIR的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号