首页> 外文会议>International Conference on Intelligent Data Analysis >A Search Engine for Morphologically Complex Languages
【24h】

A Search Engine for Morphologically Complex Languages

机译:用于形态复杂语言的搜索引擎

获取原文
获取外文期刊封面目录资料

摘要

Document retrieval on natural languages with a rich morphology -- particularly in terms of derivation and (single-word) composition -- suffers from serious performance degradation with the direct query-term-to-text-word matching paradigm that underlies the vast majority of current search engines. We propose an alternative approach in which morphologically complex word forms, which appear in the query as well as in the documents, are segmented into relevant subwords (such as stems, named entities, acronyms) and are subsequently submitted to the matching procedure. We evaluate our approach with the AltaVista{sup}TM Search Engine on a large medical document collection.
机译:文档检索与丰富的形态学 - 特别是在推导和(单词)组成方面 - 遭受严重的性能下降与直接查询 - 术语到文本词匹配范例,这是绝大多数的基础当前的搜索引擎。我们提出一种替代方法,其中在查询以及文档中出现的形态复杂的单词形式被分段为相关子字(例如茎,命名实体,首字母缩略词),随后将提交给匹配过程。我们在大型医疗文件集合上评估我们的altavista {sup} tm搜索引擎的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号