首页> 外文会议>International Conference on Digital Information Management >Mining twitterspace for information: Classifying sentiments programmatically using Java
【24h】

Mining twitterspace for information: Classifying sentiments programmatically using Java

机译:挖掘TwitterSpace了解信息:使用Java以编程方式进行分类情绪

获取原文

摘要

People increasingly use Twitter to share advice, opinions, news, moods, concerns, facts, rumors, and everything else imaginable. Much of that data is public and available for mining. However, classifying automatically the sentiment of the Twitter messages into either positive or negative with respect to a query term represents a new research challenge. Variety of approaches that use natural language and statistical techniques failed to report high accuracy of tweets classification due to the nature of these tweets containing large number of abbreviations, emoticons and ill structured grammar. In this article we are presenting a programming approach that uses the Weka data mining APIs to classify tweets. Using this programming approach we can experiment on how to train the classifiers and determine which one is more effective than the others. In our experiments, the K* classifier is found to report a high degree of accuracy in tweets classification.
机译:人们越来越多地利用Twitter分享咨询,意见,新闻,情绪,关注,事实,谣言,以及其他可想象的一切。 大部分数据都是公开的,可用于采矿。 但是,在查询期间自动对Twitter消息的情绪分类为正或否定表示新的研究挑战。 由于这些推文的性质,使用自然语言和统计技术使用自然语言和统计技术的各种方法未能报告含有大量缩写,表情符号和患病的语法的这些推文的性质。 在本文中,我们正在提出一种编程方法,它使用Weka数据挖掘API对推文进行分类。 使用这种编程方法,我们可以试验如何培训分类器并确定哪一个比其他方式更有效。 在我们的实验中,发现K *分类器在推文分类中报告了高精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号