...
首页> 外文期刊>American journal of engineering and applied sciences >A STING Algorithm and Multi-dimensional Vectors Used for English Sentiment Classification in a Distributed System
【24h】

A STING Algorithm and Multi-dimensional Vectors Used for English Sentiment Classification in a Distributed System

机译:分布式系统英语情感分类的STING算法和多维向量

获取原文
   

获取外文期刊封面封底 >>

       

摘要

Sentiment classification is significant in everyday life, such as in political activities, commodity production and commercial activities. Finding a fast, highly accurate solution to classify emotion has been a challenge for scientists. In this research, we have proposed a new model for Big Data sentiment classification in the parallel network environment - a Cloudera system with Hadoop Map (M) and Hadoop Reduce (R). Our new model has used a Statistical Information Grid Algorithm (STING) with multi-dimensional vector and 2,000,000 English documents of our English training data set for English document-level sentiment classification. Our new model can classify sentiment of millions of English documents based on many English documents in the parallel network environment. However, we tested our new model on our testing data set (including 1,000,000 English reviews, 500,000 positive and 500,000 negative) and achieved 83.92% accuracy.
机译:情感分类在日常生活中很重要,例如在政治活动,商品生产和商业活动中。寻找一种快速,高度准确的解决方案来对情绪进行分类一直是科学家的挑战。在这项研究中,我们为并行网络环境中的大数据情感分类提出了一个新模型-具有Hadoop Map(M)和Hadoop Reduce(R)的Cloudera系统。我们的新模型使用了带有多维矢量的统计信息网格算法(STING)和我们英语培训数据集中的2,000,000个英语文档,用于英语文档级情感分类。我们的新模型可以基于并行网络环境中的许多英语文档对数百万个英语文档的情感进行分类。但是,我们在测试数据集上测试了新模型(包括1,000,000个英语评论,500,000个肯定和500,000个否定),并且达到了83.92%的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号