首页> 外文期刊>International Journal of Information Management >Towards a big data framework for analyzing social media content
【24h】

Towards a big data framework for analyzing social media content

机译:建立大数据框架以分析社交媒体内容

获取原文
获取原文并翻译 | 示例
           

摘要

Modern companies generate value by digitalizing their services and products. Knowing what customers are saying about the firm through reviews in social media content constitutes a key factor to succeed in the big data era. However, social media data analysis is a complex discipline due to the subjectivity in text review and the additional features in raw data. Some frameworks proposed in the existing literature involve many steps that thereby increase their complexity. A two-stage framework to tackle this problem is proposed: the first stage is focused on data preparation and finding an optimal machine learning model for this data; the second stage relies on established layers of big data architectures focused on getting an outcome of data by taking most of the machine learning model of stage one. Thus, a first stage is proposed to analyze big and small datasets in a non-big data environment, whereas the second stage analyzes big datasets by applying the first stage machine learning model of. Then, a study case is presented for the first stage of the framework to analyze reviews of hotel-related businesses. Several machine learning algorithms were trained for two, three and five classes, with the best results being found for binary classification.
机译:现代公司通过数字化其服务和产品来创造价值。通过对社交媒体内容的评论了解客户对公司的评价,是在大数据时代取得成功的关键因素。但是,由于文本审阅的主观性和原始数据的其他功能,社交媒体数据分析是一门复杂的学科。现有文献中提出的一些框架涉及许多步骤,从而增加了它们的复杂性。提出了一个解决此问题的两阶段框架:第一阶段的重点是数据准备和为此数据寻找最佳的机器学习模型;第二阶段依赖于已建立的大数据架构层,这些层致力于通过采用第一阶段的大多数机器学习模型来获取数据结果。因此,提出了第一阶段来分析非大数据环境中的大型和小型数据集,而第二阶段则通过应用的第一阶段机器学习模型来分析大型数据集。然后,针对该框架的第一阶段提出了一个研究案例,以分析与酒店相关的业务的评论。对几种机器学习算法进行了两,三和五类的训练,对于二进制分类,发现了最好的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号