An improved inverted index model and its retrieval algorithm

机译：改进的倒指数模型及其检索算法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The traditional inverted index scheme has some deficiencies owing to its only covering the word terms' frequency and positions in documents, but not covering the space sequences of the word terms in the documents' structures. This paper developed an improved inverted index scheme, which combined the paragraph sequences, sentence sequences and word sequences as a list to replace the posting list in the traditional inverted index. And the algorithm of similarity calculation and text retrieval based on this improved inverted index scheme was given. The similarity is the result of the traditional similarity multiplying paragraph sequence similarity coefficient, sentence sequence similarity coefficient, and words sequence similarity coefficient, which can denote as SimNew(D,Q) = Sim(D,Q)~* Ceof_P ~* Ceof_s~* Ceof_w. By calculating similarity, the documents can be ranked as retrieval results. As an experiment, some documents selected from the search results of Google was reranked by similarity calculated with this algorithm. The result of the experiment shows that this algorithm is helpful for users to retrieve information which can match the users' queries much more.

机译：由于其唯一涵盖文档中的单词术语频率和位置，但不覆盖文档结构中的单词术语的空间序列，传统的倒数指数方案具有一些不足之处。本文开发了一种改进的倒置指数方案，将段序列，句子序列和单词序列组合为列表以替换传统反相索引中的发布列表。给出了基于这种改进的倒置索引方案的相似性计算和文本检索算法。相似性是传统相似性乘法段序列相似度系数，句子序列相似系数的结果，以及单词序列相似度系数，可以表示为simnew（d，q）= sim（d，q）〜* ceof_p〜* ceof_s〜 * ceof_w。通过计算相似性，文档可以作为检索结果排名。作为实验，从Google的搜索结果中选择的一些文件被使用该算法计算的相似性重新命名。实验结果表明，该算法有助于用户检索可以匹配用户查询的信息。

著录项

来源
《International Conference on Mechatronics and Information Technology》|2008年||共2页
会议地点
作者
Chaotao Liu; Zushu Li;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP271.4;
关键词
inverted-index; similarity; information retrieval; document structure; similarity coefficient;

机译：倒指数;相似度;信息检索;文件结构;相似系数;

相似文献

外文文献
中文文献
专利

1. Towards improved snow water equivalent retrieval algorithms Towards improved snow water equivalent retrieval algorithms basins of western USA [J] . Naoki Mizukami12* and Sanja Perica12 Hydrological Processes . 2012,第13期

机译：寻求改进的雪水当量检索算法寻求改进的雪水当量检索算法
2. An Improved Soil Moisture Retrieval Algorithm Based on the Land Parameter Retrieval Model for Water–Land Mixed Pixels Using AMSR-E Data [J] . Song Peilin, Huang Jingfeng, Mansaray Laurin R., IEEE Transactions on Geoscience and Remote Sensing . 2019,第10期

机译：基于AMSR-E数据的水陆混合像素土地参数反演模型的改进土壤水分反演算法
3. A Close-to-linear Topic Detection Algorithm using Relative Entropy based Relevance Model and Inverted Indices Retrieval [J] . Steve Kansheng Shi, Lemin Li International journal of computational intelligence systems . 2012,第1a6期

机译：基于相对熵的相关模型和逆指标检索的近似线性主题检测算法
4. An improved inverted index model and its retrieval algorithm [C] . Chaotao Liu, Zushu Li International Conference on Mechatronics and Information Technology . 2008

机译：改进的倒指数模型及其检索算法
5. Algorithms for inverting Hodgkin-Huxley type neuron models . [D] . Shepardson, Dylan. 2009

机译：Hodgkin-Huxley型神经元模型的逆算法。
6. Retrieval Algorithms for Road Surface Modelling Using Laser-Based Mobile Mapping [O] . Anttoni Jaakkola, Juha Hyyppä, Hannu Hyyppä, 2008

机译：基于激光的移动映射的路面建模检索算法
7. Improving The Sea Surface Wind Retrieval Algorithm For NASA Scatterometers Using Atmospheric Boundary Layer Models [O] . Young-joon Kim, Kyung S. Pak, Carol S. Hsu, 1998

机译：利用大气边界层模型改进Nasa散射仪海面风场反演算法
8. Retrieval of Geophysical Parameters with Integrated Modeling of Land Surfaces and211 Atmosphere (Models/Inversion Algorithms) [R] . Hallikainen, M., Pulliainen, J., Maetzler, K., 1998

机译：利用地表和211大气的综合建模反演地球物理参数（模型/反演算法）

An improved inverted index model and its retrieval algorithm

摘要

著录项

相似文献

相关主题

期刊订阅