首页> 外文会议>nternational Conference on Reliability, Infocom Technologies and Optimization >Topic detection using BNgram method and sentiment analysis on twitter dataset
【24h】

Topic detection using BNgram method and sentiment analysis on twitter dataset

机译:使用BNgram方法进行主题检测以及对Twitter数据集进行情感分析

获取原文

摘要

Online social and news media has become a very popular for users to share their opinions. It generates rich and timely information about actual world actions of all types. Several efforts were dedicated for mining topics, sentiments and opinions automatically from natural language in news, social media messages, and commercial reviews of product and services. Social media like facebook, twitter, online review websites like Amazon are popular sites where millions of users exchange their opinions and making it a valuable platform for tracking and analyzing trending topics and sentiments. This provides important information for decision making in various domains. An enormous amount of available data requires information filtering for drilling down the relevant topics and events. Topic detection is the solution for monitoring and summarizing information generates from social sources. Various topic detection methods are available which affect the quality of result. In this paper we use BNgram which is one of the novel topic detection methods on three large Twitter datasets associated to recent events. It has been observed that the pre-processing of the data and sampling procedure are greatly affecting the quality of detected topics. On much focused topics, standard NLP techniques can do well for social streams. But for handling more heterogeneous streams novel techniques are used. BNgram method gives the best performance, thus being more reliable. In this paper we also find the sentiments of people related to events. “Sentiwordnet dictionary” is used for finding scores of each word. And then sentiments are classified as “negative, positive and neutral”.
机译:在线社交和新闻媒体已成为用户分享意见的一种非常流行的方式。它生成有关各种类型的现实世界行为的丰富及时的信息。为了从新闻,社交媒体消息以及产品和服务的商业评论中的自然语言中自动挖掘主题,情感和观点,我们付出了一些努力。社交媒体(如facebook,twitter)和在线评论网站(如亚马逊)是受欢迎的网站,数百万用户在此交换意见,并使其成为跟踪和分析趋势主题和情绪的宝贵平台。这为各个领域的决策提供了重要信息。大量可用数据需要信息过滤以深入研究相关主题和事件。主题检测是监视和汇总从社交来源生成的信息的解决方案。可以使用各种影响结果质量的主题检测方法。在本文中,我们使用BNgram,这是对与最近事件相关的三个大型Twitter数据集的新颖主题检测方法之一。已经观察到,数据的预处理和采样过程极大地影响了所检测主题的质量。在重点突出的主题上,标准的NLP技术可以很好地处理社交流。但是为了处理更多的异构流,使用了新技术。 BNgram方法具有最佳性能,因此更可靠。在本文中,我们还发现了与事件相关的人的情绪。 “ Sentiwordnet词典”用于查找每个单词的分数。然后,情绪被分类为“负面,正面和中立”。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号