A novel method for clustering tweets in Twitter

Shanmugam Poomagal; Palanisamy Visalakshi; Thiagarajan Hamsapriya

首页> 外文期刊>International Journal of Web Based Communities >A novel method for clustering tweets in Twitter

【24h】

A novel method for clustering tweets in Twitter

机译：一种在Twitter中对推文进行聚类的新方法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

A popular social networking service called Twitter is used to post short messages that could be useful to someone in the world. These messages have been analysed by the researchers in different ways. This paper proposes a clustering technique to cluster the tweets in the Twitter. The basic aim of performing this clustering is to identify the groups of similar tweets posted and this information is useful to identify various user communities. These user communities can be recommended to the advertisers in Twitter by matching their topic of interest with the advertisers' field. Suffix Tree Clustering (STC) algorithm is the core web documents clustering algorithm which groups similar documents into clusters by constructing suffix tree. We used STC along with semantic similarity among the posted tweets to identify the topics of interest. The proposed method is compared with STC and Lingo algorithms using intra-cluster distance and inter-cluster distance. Results show that the proposed method performs better than the existing methods with 10.59% reduction in the intra-cluster distance value and 44.99% increase in the inter-cluster distance value.

机译：一种流行的社交网络服务，称为Twitter，用于发布可能对世界各地的人有用的短消息。研究人员已以不同方式分析了这些消息。本文提出了一种将Twitter中的推文进行聚类的聚类技术。执行此群集的基本目的是识别发布的类似推文的组，并且此信息对于识别各种用户社区很有用。通过将他们感兴趣的主题与广告商的字段进行匹配，可以向Twitter中的广告商推荐这些用户社区。后缀树聚类（STC）算法是核心的Web文档聚类算法，通过构造后缀树将相似的文档分为几类。我们使用STC以及已发布推文之间的语义相似性来识别感兴趣的主题。将所提出的方法与使用群集内距离和群集间距离的STC和Lingo算法进行了比较。结果表明，所提出的方法比现有方法具有更好的性能，集群内距离值减少了10.59％，集群间距离值增加了44.99％。

著录项

来源
《International Journal of Web Based Communities》 |2015年第2期|170-187|共18页
作者
Shanmugam Poomagal; Palanisamy Visalakshi; Thiagarajan Hamsapriya;
展开▼
作者单位

Department of Applied Mathematics and Computational Sciences, PSG College of Technology, Coimbatore, Tamil Nadu, India;

Department of Electronics and Communication Engineering, PSG College of Technology, Coimbatore, Tamil Nadu, India;

Oriental Institute of Science and Technology, Bhopal, India;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Twitter; tweets; semantic similarity; suffix tree clustering; STC; Lingo; inter-cluster distance; intra-cluster distance;

机译：推特;推文语义相似度;后缀树聚类;STC;林戈集群间距离;集群内距离;

相似文献

外文文献
中文文献
专利

1. Peringkasan Tweet Berdasarkan Trending Topic Twitter Dengan Pembobotan TF-IDF dan Single Linkage AngglomerativeHierarchical Clustering [J] . Annisa Annisa, Yuda Munarko, Yufis Azhar Kinetik . 2016,第1期

机译：基于带有加权TF-IDF和单一链接盎格鲁式分层聚类的Twitter趋势主题的推文摘要
2. Tweeting Apart: Applying Network Analysis to Detect Selective Exposure Clusters in Twitter [J] . Itai Himelboim, Marc Smith, Ben Shneiderman Communication Methods and Measures . 2013,第3a4期

机译：分开发推文：应用网络分析来检测Twitter中的选择性暴露群
3. I Tweet, You Tweet, (S)He Tweets: Enhancing the ESL Language-Learning Experience Through Twitter [J] . Geraldine Blattner, Amanda Dalola International Journal of Computer-Assisted Language Learning and Teaching . 2018,第2期

机译：我鸣叫，你鸣叫，（S）他鸣叫：通过Twitter增强ESL语言学习体验
4. Tweet Cluster Analyzer: Partition and Join-based Micro-clustering for Twitter Data Stream [C] . M. Arun Manicka Raja, S. Swamynathan International Conference on "Computational intelligence in Data Mining" . 2017

机译：Tweet Cluster Analyzer：Twitter数据流的分区和基于加入的微群
5. The Marcellus Shale in Maryland and Twitter: A Mixed Methods Analysis of Tweets from November 2016 [D] . Breitenother, Allison Gost. 2017

机译：马里兰和Twitter上的Marcellus页岩：2016年11月以来推文的混合方法分析
6. Sentiment Analysis of Shared Tweets on Global Warming on Twitter with Data Mining Methods: A Case Study on Turkish Language [O] . Yasin Kirelli, Seher Arslankaya 2020

机译：数据采矿方法全球变暖的共享推文的情感分析 - 以土耳其语为例
7. Sentiment Analysis of Shared Tweets on Global Warming on Twitter with Data Mining Methods: A Case Study on Turkish Language [O] . Yasin Kirelli, Seher Arslankaya 2020

机译：数据采矿方法全球变暖的共享推文的情感分析 - 以土耳其语为例

A novel method for clustering tweets in Twitter

摘要

著录项

相似文献

相关主题

期刊订阅