首页> 外文期刊>JISTEM - Journal of Information Systems and Technology Management >Information retrieval system using Multiwords Expressions (MWE) as descriptors
【24h】

Information retrieval system using Multiwords Expressions (MWE) as descriptors

机译:使用多词表达式(MWE)作为描述符的信息检索系统

获取原文
           

摘要

This paper aims to propose an alternative method for retrieving documents using Multiwords Expressions (MWE) extracted from a document base to be used as descriptors in search of an Information Retrieval System (IRS). In this sense, unlike methods that consider the text as a set of words, bag of words, we propose a method that takes into account the characteristics of the physical structure of the document in the extraction process of MWE. From this set of terms comparing pre-processed using an exhaustive algorithmic technique proposed by the authors with the results obtained for thirteen different measures of association statistics generated by the software Ngram Statistics Package (NSP). To perform this experiment was set up with a corpus of documents in digital format.
机译:本文旨在提出一种替代方法,该方法使用从文档库中提取的多字表达式(MWE)来检索文档,并将其用作信息检索系统(IRS)的描述符。从这个意义上讲,与将文本视为一组单词(单词袋)的方法不同,我们提出了一种在MWE的提取过程中考虑文档物理结构特征的方法。从这组术语中,比较了使用作者提出的详尽算法技术进行预处理的结果,以及使用Ngram Statistics Package(NSP)软件生成的十三种不同度量的关联统计结果。为了执行此实验,设置了一个数字格式的文档集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号