Effective and efficient document ranking without using a large lexicon

机译：有效而高效的文档排名，而无需使用大型词典

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Although a word-based method is commonly used in document retrieval, it cannot be directly applicable to languages that have no obvious word separator. Given a lexicon, itis possible to identify words in documents, but a large lexicon is troublesome to maintain and makes retrieval systems large and complicated. This paper proposes an effective and efficient ranking that does not use a large lexicon; words need not be identified during document registration because a character-based signature file is used for the access structure. A user request, during document retrieval, is statistically analyzed to generate an appropriate query, and the query is evaluated efficiently in a word-based manner using the character-based index. We also propose two optimizing techniques to accelerate retrieval.

机译：尽管基于单词的方法通常用于文档检索中，但是它不能直接应用于没有明显单词分隔符的语言。给定一个词典，可以识别文档中的单词，但是大型词典难以维护，并且使检索系统变得庞大而复杂。本文提出了一种不使用大型词典的有效而高效的排名方法。在文档注册期间无需识别单词，因为基于字符的签名文件用于访问结构。在文档检索期间，对用户请求进行统计分析以生成适当的查询，并使用基于字符的索引以基于单词的方式对查询进行有效评估。我们还提出了两种优化技术来加快检索速度。

著录项

来源
《Twenty-Second international conference on very large data bases(VLDB'96)》|1996年|p.192-202|共11页
会议地点 Mumbai(IN);Mumbai(IN)
作者
OGAWA Yasushi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Effective and Efficient Ranking and Re-Ranking Feature Selector for Healthcare Analytics [J] . S.Ilangovan, A. Vincent Antony Kumar Intelligent automation and soft computing . 2020,第2期

机译：用于医疗保健分析的有效和高效的排名和重新排名功能选择器
2. An effective approach for semantic-based clustering and topic-based ranking of web documents [J] . Rajendra Kumar Roul International Journal of Data Science and Analytics . 2018,第4期

机译：Web文档基于语义的聚类和基于主题的排名的有效方法
3. Finding and ranking compact connected trees for effective keyword proximity search in XML documents [J] . Jianhua Feng, Guoliang Li, Jianyong Wang, Information Systems . 2010,第2期

机译：查找和排序紧凑的连接树，以便在XML文档中进行有效的关键字邻近搜索
4. Effective and efficient document ranking without using a large lexicon [C] . OGAWA Yasushi International conference on very large data bases . 1996

机译：无需使用大型词典的有效和有效的文件排名
5. Effective and efficient binarization of degraded document images. [D] . Parker, Jon Ivan. 2016

机译：对退化的文档图像进行有效和高效的二值化。
6. Medical document anonymization with a semantic lexicon. [O] . P. Ruch, R. H. Baud, A. M. Rassinoux, 2000

机译：使用语义词典对医疗文档进行匿名化。
7. An Effective XML Identifying Feedback Documents Method based on Two-Stage Ranking Model for Pseudo-Relevance Feedback [O] . Zhong Minjuan 2016

机译：一种基于伪相关反馈两阶段排序模型的有效XmL识别反馈文档方法

Effective and efficient document ranking without using a large lexicon

摘要

著录项

相似文献

相关主题

期刊订阅