【24h】

Efficient and Parallel Framework for Analyzing the Sentiment

机译:用于分析情绪的高效和平行框架

获取原文

摘要

With the advent of Web 2.0, user-generated content is led to an explosion of data on the Internet. Several platforms such as social networking, microblogging, and picture sharing exist that allow users to express their views on almost any topic. The user views express their emotions and sentiments on products, services, any action by governments, etc. Sentiment analysis allows quantifying popular mood on any product, service or an idea. Twitter is popular microblogging platform, which permits users to express their views in a very concise manner. In this paper, a new framework is crafted which carried out the entire chain of tasks starting with extraction of tweets to presenting the results in multiple formats using an ETL (Extract, Transform, and Load) big data tool called Talend. The framework includes a technique to quantify sentiment in a Twitter stream by normalizing the text and judge the polarity of textual data as positive, negative, or neutral. The technique addresses peculiarities of Twitter communication to enhance accuracy. The technique gives an accuracy of above 84% on standard datasets.
机译:随着Web 2.0的出现,用户生成的内容被导致互联网上的数据爆炸。存在若干平台,如社交网络,微博和图片共享,允许用户对几乎任何主题表达他们的视图。用户观点表达了他们的情感和对各国政府的任何行动的情感和情绪等。情绪分析允许量化任何产品,服务或想法的流行情绪。 Twitter是流行的微博平台,允许用户以非常简洁的方式表达他们的观点。在本文中,制作了一个新的框架,它从提取推文开始的整个任务链,以使用名为talend的ETL(提取,变换和负载)大数据工具以多种格式呈现结果。该框架包括通过归一化文本来量化Twitter流中的情绪的技术,并判断文本数据的极性,为正,负或中性。该技术解决了Twitter通信的特点,以提高准确性。该技术在标准数据集中提供了高于84%的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号