An improved inverted index model and its retrieval algorithm

机译：改进的倒排索引模型及其检索算法

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

The traditional inverted index scheme has some deficiencies owing to its only covering the word terms' frequency and positions in documents, but not covering the space sequences of the word terms in the documents' structures. This paper developed an improved inverted index scheme, which combined the paragraph sequences, sentence sequences and word sequences as a list to replace the posting list in the traditional inverted index. And the algorithm of similarity calculation and text retrieval based on this improved inverted index scheme was given. The similarity is the result of the traditional similarity multiplying paragraph sequence similarity coefficient, sentence sequence similarity coefficient, and words sequence similarity coefficient, which can denote as SimNew(D,Q) = Sim(D,Q)~* Ceof_P ~* Ceof_s~* Ceof_w. By calculating similarity, the documents can be ranked as retrieval results. As an experiment, some documents selected from the search results of Google was reranked by similarity calculated with this algorithm. The result of the experiment shows that this algorithm is helpful for users to retrieve information which can match the users' queries much more.

机译：传统的倒排索引方案由于仅覆盖单词词在文档中的频率和位置而没有覆盖文档结构中单词词的空间序列，因此存在一些缺陷。本文提出了一种改进的倒排索引方案，该方案将段落序列，句子序列和单词序列组合为列表，以取代传统倒排索引中的发布列表。给出了基于改进的倒排索引方案的相似度计算和文本检索算法。相似度是传统相似度乘以段落序列相似度系数，句子序列相似度系数和单词序列相似度系数的结果，可以表示为SimNew（D，Q）= Sim（D，Q）〜* Ceof_P〜* Ceof_s〜 * Ceof_w。通过计算相似度，可以将文档排名为检索结果。作为实验，通过此算法计算出的相似度对从Google搜索结果中选择的一些文档进行了排名。实验结果表明，该算法对用户检索与用户查询更加匹配的信息很有帮助。

著录项

来源
《International Conference on Mechatronics and Information Technology; 20071205-06; Gifu(JP)》|2007年|P.ivi|共2页
会议地点 Gifu(JP)
作者
Chaotao Liu; Zushu Li;
展开▼
作者单位

Institute of Intelligent Automation, Chongqing University, Chongqing, 400044 ,China;

School of Mechatronics and Automotive Engineering, Chongqing Jiaotong University, Chongqing, 400074, China;

Institute of Artificial Intelligence System, Chongqing Institut;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类信息处理（信息加工）;
关键词
inverted-index; similarity; information retrieval; document structure; similarity coefficient;

机译：倒排索引相似度信息检索文档结构相似系数;
入库时间 2022-08-26 13:52:10

相似文献

外文文献
中文文献
专利

1. Towards improved snow water equivalent retrieval algorithms Towards improved snow water equivalent retrieval algorithms basins of western USA [J] . Naoki Mizukami12* and Sanja Perica12 Hydrological Processes . 2012,第13期

机译：寻求改进的雪水当量检索算法寻求改进的雪水当量检索算法
2. An Improved Soil Moisture Retrieval Algorithm Based on the Land Parameter Retrieval Model for Water–Land Mixed Pixels Using AMSR-E Data [J] . Song Peilin, Huang Jingfeng, Mansaray Laurin R., IEEE Transactions on Geoscience and Remote Sensing . 2019,第10期

机译：基于AMSR-E数据的水陆混合像素土地参数反演模型的改进土壤水分反演算法
3. A Close-to-linear Topic Detection Algorithm using Relative Entropy based Relevance Model and Inverted Indices Retrieval [J] . Steve Kansheng Shi, Lemin Li International journal of computational intelligence systems . 2012,第1a6期

机译：基于相对熵的相关模型和逆指标检索的近似线性主题检测算法
4. An improved inverted index model and its retrieval algorithm [C] . Chaotao Liu, Zushu Li International Conference on Mechatronics and Information Technology . 2008

机译：改进的倒指数模型及其检索算法
5. Algorithms for inverting Hodgkin-Huxley type neuron models . [D] . Shepardson, Dylan. 2009

机译：Hodgkin-Huxley型神经元模型的逆算法。
6. Retrieval Algorithms for Road Surface Modelling Using Laser-Based Mobile Mapping [O] . Anttoni Jaakkola, Juha Hyyppä, Hannu Hyyppä, 2008

机译：基于激光的移动映射的路面建模检索算法
7. Improving The Sea Surface Wind Retrieval Algorithm For NASA Scatterometers Using Atmospheric Boundary Layer Models [O] . Young-joon Kim, Kyung S. Pak, Carol S. Hsu, 1998

机译：利用大气边界层模型改进Nasa散射仪海面风场反演算法
8. Retrieval of Geophysical Parameters with Integrated Modeling of Land Surfaces and211 Atmosphere (Models/Inversion Algorithms) [R] . Hallikainen, M., Pulliainen, J., Maetzler, K., 1998

机译：利用地表和211大气的综合建模反演地球物理参数（模型/反演算法）

An improved inverted index model and its retrieval algorithm

摘要

著录项

相似文献

相关主题

期刊订阅