An Effective TF/IDF-Based Text-to-Text Semantic Similarity Measure for Text Classification

机译：基于有效的TF / IDF的文本文本语义相似度，用于文本分类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The use of semantics in tasks related to information retrieval has become, in recent years, a vast field of research. Considering supervised text classification, which is the main interest of this work, semantics can be involved at different steps of text processing: during indexing step, during training step and during class prediction step. As for class prediction step, new text-to-text semantic similarity measures can replace classical similarity measures that are traditionally used by some classification methods for decision-making. In this paper we propose a new measure for assessing semantic similarity between texts based on TF/IDF with a new function that aggregates semantic similarities between concepts representing the compared text documents pair-to-pair. Experimental results demonstrate that our measure outperforms other semantic and classical measures with significant improvements.

机译：近年来，在与信息检索相关的任务中使用语义已经成为了巨大的研究领域。考虑到监督文本分类，这是这项工作的主要兴趣，语义可以涉及文本处理的不同步骤：在索引步骤期间，在训练步骤和课程预测步骤期间。至于类预测步骤，新的文本语义相似度措施可以替换传统上由某些分类方法用于决策的经典相似性测量。在本文中，我们提出了一种新的措施，用于评估基于TF / IDF的文本之间的语义相似性，具有汇总表示比较文档对对对的概念之间的语义相似性的新函数。实验结果表明，我们的衡量越来越优于其他语义和经典措施，具有显着的改进。

著录项

来源
《International Conference on Web Information Systems Engineering》|2014年||共10页
会议地点
作者
Shereen Albitar; Sebastien Fournier; Bernard Espinasse;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP393-53;
关键词
Classification; Semantics; Text-to-Text Semantic Similarity;

机译：分类;语义;文本文本语义相似性;

相似文献

外文文献
中文文献
专利

1. EFFECTIVE SEMANTIC TEXT SIMILARITY METRIC USING NORMALIZED ROOT MEAN SCALED SQUARE ERROR [J] . ISSA ATOUM, MARUTHI ROHIT AYYAGARI Journal of Theoretical and Applied Information Technology . 2019,第12期

机译：使用归一化均方根平方误差的有效语义文本相似度度量
2. Measuring the short text similarity based on semantic and syntactic information [J] . Jiaqi Yang, Yongjun Li, Congjie Gao, Future generation computer systems . 2021,第Jana期

机译：基于语义和句法信息测量短文本相似性
3. A Comparison of Approaches for Measuring the Semantic Similarity of Short Texts Based on Word Embeddings [J] . Karlo Babi?, Francesco Guerra, Sanda Martin?i?-Ip?i?, Journal of Information and Organizational Sciences . 2020,第2期

机译：基于Word Embeddings测量短文本语义相似性的方法的比较
4. An Effective TF/IDF-Based Text-to-Text Semantic Similarity Measure for Text Classification [C] . Shereen Albitar, Sebastien Fournier, Bernard Espinasse International conference on web information systems engineering . 2014

机译：一种有效的基于TF / IDF的文本到文本语义相似度度量用于文本分类
5. An Automatic Similarity Detection Engine Between Sacred Texts Using Text Mining and Similarity Measures [D] . Qahl, Salha Hassan Muhammed. 2014

机译：使用文本挖掘和相似度度量的神圣文本之间的自动相似度检测引擎
6. Evaluating measures of semantic similarity and relatedness to disambiguate terms in biomedical text [O] . Bridget McInnes, Ted Pedersen -1

机译：评价语义相似性和相关性的措施以消除生物医学文本中的歧义术语
7. Text-to-text semantic similarity for automatic short answer grading [O] . Michael Mohler, Rada Mihalcea 2009

机译：None

An Effective TF/IDF-Based Text-to-Text Semantic Similarity Measure for Text Classification

摘要

著录项

相似文献

相关主题

期刊订阅