Metrics for information retrieval: A case study

机译：信息检索指标：案例研究

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The domain of information retrieval (IR)has used clustering methods in a big way. Clustering is a technique that groups a set of documents into clusters or subsets. How efficiently and effectively the relevant documents are extracted from World Wide Web is a challenging issue. In this work, we compare and analyse the effectiveness of similarity measures such as City Block distance, Cosine similarity, Point symmetry distance and Dicecoefficient to improve document clustering with and without the presence of ontology. This has two objectives: a comparison of metrics in the domain and study the impact of various methods like ontology comparison and clustering on the metrics as a whole. This will lead to further refinement of the metrics for current and future needs in the domain. Earlier works in the domain have highlighted the fact that the results of the similarity measures are more or less the same. However our work shows that the use of ontology based clustering marked changes in the results. The results show the need for more work to be focused on the metrics aspect in information retrieval.

机译：信息检索（IR）领域在很大程度上使用了聚类方法。聚类是将一组文档分为聚类或子集的技术。如何有效地从万维网提取相关文档是一个具有挑战性的问题。在这项工作中，我们比较和分析了诸如城市街区距离，余弦相似度，点对称距离和Dicecoefficient之类的相似性度量的有效性，以改善有无本体存在下的文档聚类。这有两个目标：比较域中的指标，研究本体比较和聚类等各种方法对指标整体的影响。这将导致对该领域当前和未来需求的度量标准的进一步完善。该领域的较早作品强调了一个事实，即相似性度量的结果或多或少都是相同的。但是，我们的工作表明，使用基于本体的聚类标记了结果的变化。结果表明需要更多的工作集中在信息检索的指标方面。

著录项

来源
《International Conference on Software Engineering and Mobile Application Modelling and Development》|2012年|1-5|共5页
会议地点 Chennai(IN)
作者
Nadana Ravishankar T.; Shriram R.;
展开▼
作者单位

Dept. of CSE, B.S.Abdur Rahman University, Chennai, Indiac;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
K-means; Text clustering; ontology; similarity measures;

机译：K-均值;文本聚类;本体;相似性度量;;

相似文献

外文文献
中文文献
专利

1. Air retrieval for clot retrieval; time-metrics and outcomes of stroke patients from rural and remote regions air-transported for mechanical thrombectomy at a state stroke unit [J] . Crockett Matthew T., Jha Nihar, Hooper Andrew J., Journal of clinical neuroscience: official journal of the Neurosurgical Society of Australasia . 2019,第期

机译：空气检索凝块检索; 来自农村和偏远地区的中风患者的时间指标和结果，在状态卒中单元在机械血栓切除术中运输
2. Metric-Learning-Based Deep Hashing Network for Content-Based Retrieval of Remote Sensing Images [J] . Subhankar Roy, Enver Sangineto, Begüm Demir, IEEE Geoscience and Remote Sensing Letters . 2021,第2期

机译：基于公制学习的深度散脉网络，用于基于内容的遥感图像检索
3. Deep Metric Learning for Multi-Label and Multi-Object Image Retrieval [J] . Jonathan MOJOO, Takio KURITA IEICE transactions on information and systems . 2021,第6期

机译：多标签和多对象图像检索的深度度量学习
4. Poster: Which Similarity Metric to Use for Software Documents?: A Study on Information Retrieval Based Software Engineering Tasks [C] . Md Masudur Rahman, Saikat Chakraborty, Baishakhi Ray 2018 IEEE/ACM 40th International Conference on Software Engineering: Companion . 2018

机译：海报：用于软件文档的相似性度量标准是什么：基于信息检索的软件工程任务研究
5. Adversarial Prediction Framework for Information Retrieval and Natural Language Processing Metrics [D] . Wang, Hong. 2017

机译：信息检索和自然语言处理指标的对抗性预测框架
6. Distribution Structure Learning Loss (DSLL) Based on Deep Metric Learning for Image Retrieval [O] . Lili Fan, Hongwei Zhao, Haoyu Zhao, 2019

机译：基于深度度量学习的分布结构学习损失（DSLL）图像检索
7. Adapting Binary Information Retrieval Evaluation Metrics for Segment-based Retrieval Tasks [O] . Aly, Robin, Eskevich, Maria, Ordelman, Roeland, 2013

机译：适应二进制信息检索评估指标基于段的检索任务
8. Documentation and Information Retrieval Aspects of Army Studies. Volume II. Annex C to the Army Study System. Study Documentation and Information Retrieval [R] . Davis, C. J. 1963

机译：陆军研究的文献和信息检索方面。第二卷。陆军研究系统附件C.研究文档和信息检索

Metrics for information retrieval: A case study

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅