A Comparison of Recent Information Retrieval Term-Weighting Models Using Ancient Datasets

机译：使用古代数据集的最近信息检索术语加权模型的比较

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

With the development of technology, human computer interaction is continuously increasing. Parallel to this, information from web sites, social media, blogs and other applications reach enormous dimensions. It becomes a big problem to obtain the desired information from this mass of data. One way of solving this problem is to keep the information correctly indexed and searched by using information retrieval methods. Information retrieval is the study of finding documents of unstructured material which should satisfy users' information needs. Various term-weighting models have been proposed for information retrieval. This work is carried out to analyze and evaluate the retrieval effectiveness of recently developed term-weighting models (after the 2000s) using the earlier datasets (dating back as far as the 1980s) with the motivation of such comparison has not been done.The open source library Apache Lucene is used for all experiments and evaluation. As a result, we observe that the DFIC model is in general more effective than the other models. We note also that, although one model can be the most effective for one dataset, the same model can be the least effective for another dataset.

机译：随着技术的发展，人机交互不断增长。与此平行的是，来自网站，社交媒体，博客和其他应用程序的信息达到了巨大的规模。从海量数据中获得所需信息成为一个大问题。解决此问题的一种方法是通过使用信息检索方法来正确地对信息进行索引和搜索。信息检索是寻找应满足用户信息需求的非结构化材料文档的研究。已经提出了用于信息检索的各种术语加权模型。这项工作是使用较早的数据集（可追溯到1980年代）来分析和评估最近开发的术语加权模型（在2000年代之后）的检索效果的，尚未进行这种比较的动机。源代码库Apache Lucene用于所有实验和评估。结果，我们观察到DFIC模型通常比其他模型更有效。我们还注意到，尽管一个模型对于一个数据集可能是最有效的，但是同一模型对于另一个数据集可能是最无效的。

著录项

来源
《International Conference on Artificial Intelligence and Data Processing》|2018年|1-4|共4页
会议地点 Malatya(TR)
作者
Ahmet Alkılınç; Ahmet Arslan;
展开▼
作者单位

Department of Computer Engineering Eskişehir Technical University Eskişehir Turkey;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Indexing; Mathematical model; Information retrieval; Tokenization; Standards; Task analysis;

机译：索引；数学模型;信息检索；令牌化；标准；任务分析;

相似文献

外文文献
中文文献
专利

1. Improved FTIR retrieval strategy for HCFC-22 (CHClF2), comparisons with in situ and satellite datasets with the support of models, and determination of its long-term trend above Jungfraujoch [J] . Prignon Maxime, Chabrillat Simon, Minganti Daniele, Atmospheric Chemistry and Physics Discussions . 2019,第19期

机译：改进了HCFC-22（CHCLF2）的FTIR检索策略，与模型的支持和卫星数据集的比较，以及少年joch以上的长期趋势的确定
2. Improved FTIR retrieval strategy for HCFC-22 (CHClFsub2/sub), comparisons with in situ and satellite datasets with the support of models, and determination of its long-term trend above Jungfraujoch [J] . Maxime Prignon, Simon Chabrillat, Daniele Minganti, Atmospheric chemistry and physics . 2019,第19期

机译：改进了HCFC-22的FTIR检索策略（CHCLF _{2 ），与模型的支持，与卫星数据集的比较，以及确定其在Jungfraujoch之上的长期趋势}
3. An axiomatic comparison of learned term-weighting schemes in information retrieval: clarifications and extensions [J] . Ronan Cummins, Colm ORiordan Artificial Intelligence Review: An International Science and Engineering Journal . 2007,第1期

机译：信息检索中学到的术语加权方案的公理比较：说明和扩展
4. A Comparison of Recent Information Retrieval Term-Weighting Models Using Ancient Datasets [C] . Ahmet Alk?l?n?, Ahmet Arslan International Conference on Artificial Intelligence and Data Processing . 2018

机译：古代数据集最近信息检索术语加权模型的比较
5. Temperature index modeling of the Kahiltna Glacier: Comparison to multiple field and geodetic mass balance datasets. [D] . Young, Joanna C. 2013

机译：Kahiltna冰川的温度指数建模：与多场和大地质量平衡数据集的比较。
6. Variability in brain network model dynamics: comparison of neural mass models and empirical connectivity datasets in The Virtual Brain [O] . M Marmaduke Woodman, Viktor K Jirsa 2013

机译：脑网络模型动力学中的可变性：虚拟大脑中神经质量模型和经验连通性数据集的比较
7. Training deep retrieval models with noisy datasets: Bag exponential loss [O] . Tomás Martínez-Cortés, Iván González-Díaz, Fernando Díaz-de-María 2021

机译：使用嘈杂的数据集培训深度检索模型：袋指数损失
8. Evaluation of the Event Driven Phenology Model Coupled with the VegET Evapotranspiration Model Through Comparisons with Reference Datasets in a Spatially Explicit Manner [R] . Kovalskyy, V., Henebry, G. M., Adusei, B., 2011

机译：通过与空间显式方式与参考数据集进行比较，评估事件驱动的物候模型与VegET蒸发蒸腾模型的耦合

A Comparison of Recent Information Retrieval Term-Weighting Models Using Ancient Datasets

摘要

著录项

相似文献

相关主题

期刊订阅