An Effective TF/IDF-Based Text-to-Text Semantic Similarity Measure for Text Classification

机译：一种有效的基于TF / IDF的文本到文本语义相似度度量用于文本分类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The use of semantics in tasks related to information retrieval has become, in recent years, a vast field of research. Considering supervised text classification, which is the main interest of this work, semantics can be involved at different steps of text processing: during indexing step, during training step and during class prediction step. As for class prediction step, new text-to-text semantic similarity measures can replace classical similarity measures that are traditionally used by some classification methods for decision-making. In this paper we propose a new measure for assessing semantic similarity between texts based on TF/IDF with a new function that aggregates semantic similarities between concepts representing the compared text documents pair-to-pair. Experimental results demonstrate that our measure outperforms other semantic and classical measures with significant improvements.

机译：近年来，在与信息检索相关的任务中使用语义已成为一个广阔的研究领域。考虑到监督文本分类是这项工作的主要目的，语义可以涉及文本处理的不同步骤：在索引步骤，训练步骤和班级预测步骤中。至于类预测步骤，新的文本到文本语义相似性度量可以代替一些分类方法传统上用于决策的经典相似性度量。在本文中，我们提出了一种新的方法，该方法用于评估基于TF / IDF的文本之间的语义相似性，该新功能可以汇总表示成对文本的比较文本文档的概念之间的语义相似性。实验结果表明，我们的方法在显着改进方面优于其他语义和经典方法。

著录项

来源
《International conference on web information systems engineering》|2014年|105-114|共10页
会议地点
作者
Shereen Albitar; Sebastien Fournier; Bernard Espinasse;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Classification; Semantics; Text-to-Text Semantic Similarity;

机译：分类;语义学文本到文本的语义相似度;

相似文献

外文文献
中文文献
专利

1. EFFECTIVE SEMANTIC TEXT SIMILARITY METRIC USING NORMALIZED ROOT MEAN SCALED SQUARE ERROR [J] . ISSA ATOUM, MARUTHI ROHIT AYYAGARI Journal of Theoretical and Applied Information Technology . 2019,第12期

机译：使用归一化均方根平方误差的有效语义文本相似度度量
2. Measuring the short text similarity based on semantic and syntactic information [J] . Jiaqi Yang, Yongjun Li, Congjie Gao, Future generation computer systems . 2021,第Jana期

机译：基于语义和句法信息测量短文本相似性
3. A Comparison of Approaches for Measuring the Semantic Similarity of Short Texts Based on Word Embeddings [J] . Karlo Babi?, Francesco Guerra, Sanda Martin?i?-Ip?i?, Journal of Information and Organizational Sciences . 2020,第2期

机译：基于Word Embeddings测量短文本语义相似性的方法的比较
4. An Effective TF/IDF-Based Text-to-Text Semantic Similarity Measure for Text Classification [C] . Shereen Albitar, Sebastien Fournier, Bernard Espinasse International Conference on Web Information Systems Engineering . 2014

机译：基于有效的TF / IDF的文本文本语义相似度，用于文本分类
5. An Automatic Similarity Detection Engine Between Sacred Texts Using Text Mining and Similarity Measures [D] . Qahl, Salha Hassan Muhammed. 2014

机译：使用文本挖掘和相似度度量的神圣文本之间的自动相似度检测引擎
6. Evaluating measures of semantic similarity and relatedness to disambiguate terms in biomedical text [O] . Bridget McInnes, Ted Pedersen -1

机译：评价语义相似性和相关性的措施以消除生物医学文本中的歧义术语
7. Text-to-text semantic similarity for automatic short answer grading [O] . Michael Mohler, Rada Mihalcea 2009

机译：None

An Effective TF/IDF-Based Text-to-Text Semantic Similarity Measure for Text Classification

摘要

著录项

相似文献

相关主题

期刊订阅