首页> 外文会议>International conference on advances in computing, communications and informatics >Twitter sentiment classification using machine learning techniques for stock markets
【24h】

Twitter sentiment classification using machine learning techniques for stock markets

机译:使用机器学习技术对股票市场进行Twitter情感分类

获取原文

摘要

Sentiment classification of Twitter data has been successfully applied in finding predictions in a variety of domains. However, using sentiment classification to predict stock market variables is still challenging and ongoing research. The main objective of this study is to compare the overall accuracy of two machine learning techniques (logistic regression and neural network) with respect to providing a positive, negative and neutral sentiment for stock-related tweets. Both classifiers are compared using Bigram term frequency (TF) and Unigram term frequency - inverse document term frequency (TF-IDF) weighting schemes. Classifiers are trained using a dataset that contains 42,000 automatically annotated tweets. The training dataset forms positive, negative and neutral tweets covering four technology-related stocks (Twitter, Google, Facebook, and Tesla) collected using Twitter Search API. Classifiers give the same results in terms of overall accuracy (58%). However, empirical experiments show that using Unigram TF-IDF outperforms TF.
机译:Twitter数据的情感分类已成功应用于各种领域的预测中。但是,使用情绪分类来预测股票市场变量仍然具有挑战性并且正在进行研究。这项研究的主要目的是比较两种机器学习技术(逻辑回归和神经网络)的总体准确性,以提供与股票相关的推文的正面,负面和中性情绪。使用Bigram词频(TF)和Unigram词频-逆文档词频(TF-IDF)加权方案对这两个分类器进行比较。使用包含42,000个自动注释的推文的数据集训练分类器。训练数据集形成正面,负面和中立的推文,涵盖使用Twitter搜索API收集的四种与技术相关的股票(Twitter,Google,Facebook和Tesla)。分类器在总体准确度方面给出相同的结果(58%)。但是,经验实验表明,使用Unigram TF-IDF优于TF。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号