A Search Engine for Morphologically Complex Languages

机译：用于形态复杂语言的搜索引擎

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Document retrieval on natural languages with a rich morphology -- particularly in terms of derivation and (single-word) composition -- suffers from serious performance degradation with the direct query-term-to-text-word matching paradigm that underlies the vast majority of current search engines. We propose an alternative approach in which morphologically complex word forms, which appear in the query as well as in the documents, are segmented into relevant subwords (such as stems, named entities, acronyms) and are subsequently submitted to the matching procedure. We evaluate our approach with the AltaVista{sup}TM Search Engine on a large medical document collection.

机译：文档检索与丰富的形态学 - 特别是在推导和（单词）组成方面 - 遭受严重的性能下降与直接查询 - 术语到文本词匹配范例，这是绝大多数的基础当前的搜索引擎。我们提出一种替代方法，其中在查询以及文档中出现的形态复杂的单词形式被分段为相关子字（例如茎，命名实体，首字母缩略词），随后将提交给匹配过程。我们在大型医疗文件集合上评估我们的altavista {sup} tm搜索引擎的方法。

著录项

来源
《International Conference on Intelligent Data Analysis》|2001年||共11页
会议地点
作者
Udo Hahn; Martin Honeck; Stefan Schulz;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP3-53;
关键词

相似文献

外文文献
中文文献
专利

1. Do natural language search engines really understand what users want? A comparative study on three natural language search engines and Google [J] . Nadjla Hariri Online Information Review . 2013,第2期

机译：自然语言搜索引擎真的了解用户的需求吗？三种自然语言搜索引擎与Google的比较研究
2. Hindi Language Morphology and Search Engines Performance [J] . S.K. Dwivedi, Parul Rastogi, Vipin Awasthi International journal of computational intelligence research . 2010,第1期

机译：印地语语言形态和搜索引擎性能
3. Problems with the use of web search engines to find results in foreign languages [J] . Dirk Lewandowski Online Information Review . 2008,第5期

机译：使用网络搜索引擎查找外语结果时遇到的问题
4. A Search Engine for Morphologically Complex Languages [C] . Udo Hahn, Martin Honeck, Stefan Schulz International Conference on Intelligent Data Analysis . 2001

机译：用于形态复杂语言的搜索引擎
5. Improving Search via Named Entity Recognition in Morphologically Rich Languages: A Case Study in Urdu [D] . Riaz, Kashif H. 2018

机译：通过形态丰富的语言中的命名实体识别来改善搜索：以乌尔都语为例
6. MEGGASENSE – The Metagenome/Genome Annotated Sequence Natural Language Search Engine: A Platform for the Construction of Sequence Data Warehouses [O] . Ranko Gacesa, Jurica Zucko, Solveig K. Petursdottir, 2017

机译：MEGGASENSE –元基因组/基因组注释序列自然语言搜索引擎：构建序列数据仓库的平台
7. Problems with the use of Web search engines to find results in foreign languages [O] . Dirk Lewandowski 2013

机译：使用Web搜索引擎查找外语结果的问题

A Search Engine for Morphologically Complex Languages

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅