Consensus Similarity Measure for Short Text Clustering

机译：短文本聚类的共识相似性度量

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Measuring semantic similarity between short texts is challenging because the meaning of short texts may vary dramatically even by a few words due to their limited lengths. In this paper, we propose a novel similarity measure for terms that allows better clustering performance than the state-of-the-art method. To achieve such performance, we incorporate knowledge-based and corpus-based term similarity measures in order to exploit advantages of both approaches. We apply our method to a dialog-utterance dataset, which consists of short dialog texts. Empirical study shows that the proposed method outperforms one of the state-of-the-art clustering algorithms for short text clustering.

机译：测量短文本之间的语义相似性具有挑战性，因为由于长度有限，短文本的含义可能会发生巨大变化，即使只有几个词也是如此。在本文中，我们为术语提出了一种新颖的相似性度量，与最新技术方法相比，该度量具有更好的聚类性能。为了实现这种性能，我们结合了基于知识和基于语料库的术语相似性度量，以便利用两种方法的优势。我们将我们的方法应用于由简短对话文本组成的对话话语数据集。实证研究表明，所提出的方法优于短文本聚类的最新聚类算法之一。

著录项

来源
《International workshop on database and expert systems applications》|2015年|264-268|共5页
会议地点
作者
Youhyun Shin; Yeonchan Ahn; Heesik Jeon; Sang-goo Lee;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
clustering; semantic similarity; short text;

机译：聚类;语义相似度;短文本;

相似文献

外文文献
中文文献
专利

1. BTM and GloVe Similarity Linear Fusion-Based Short Text Clustering Algorithm for Microblog Hot Topic Discovery [J] . Wu Di, Zhang Mengtian, Shen Chao, Quality Control, Transactions . 2020,第期

机译：基于BTM和手套相似性线性融合的微博热门主题发现的简短文本聚类算法
2. An Improved Similarity Matching based Clustering Framework for Short and Sentence Level Text [J] . M. John Basha, K.P. Kaliyamurthie International Journal of Electrical and Computer Engineering . 2017,第1期

机译：一种改进的基于相似度匹配的短句子级文本聚类框架
3. Measuring the short text similarity based on semantic and syntactic information [J] . Jiaqi Yang, Yongjun Li, Congjie Gao, Future generation computer systems . 2021,第Jana期

机译：基于语义和句法信息测量短文本相似性
4. Consensus Similarity Measure for Short Text Clustering [C] . Youhyun Shin, Yeonchan Ahn, Heesik Jeon, International workshop on database and expert systems applications . 2015

机译：短文本群集的共识相似度测量
5. An Automatic Similarity Detection Engine Between Sacred Texts Using Text Mining and Similarity Measures [D] . Qahl, Salha Hassan Muhammed. 2014

机译：使用文本挖掘和相似度度量的神圣文本之间的自动相似度检测引擎
6. GO functional similarity clustering depends on similarity measure clustering method and annotation completeness [O] . Meng Liu, Paul D. Thomas 2019

机译：GO功能相似性聚类取决于相似性度量聚类方法和注释完整性
7. FUSE (Fuzzy Similarity Measure) - A measure for determining fuzzy short text similarity using Interval Type-2 fuzzy sets [O] . Naeemeh Adel, Keeley Crockett, Alan Crispin, 2018

机译：熔断器（模糊相似度测量） - 使用间隔类型-2模糊集确定模糊短文本相似度的度量

Consensus Similarity Measure for Short Text Clustering

摘要

著录项

相似文献

相关主题

期刊订阅