首页> 外文会议>International Conference on Systems and Informatics >Analysis of dengue outbreaks using big data analytics and social networks
【24h】

Analysis of dengue outbreaks using big data analytics and social networks

机译:使用大数据分析和社交网络分析登革热暴发

获取原文

摘要

The term Big Data can be defined as analysis a large volume of data in unstructured databases, organizations of different segment and size, has been employing the Big Data Analytics philosophy as support tool strategic to anticipate valuable insights and trends on the behavior of consumers and their expectations, thus gaining a competitive advantage in the market in which they operate. However, to extract information from values as the goal of turning that volume of data into predictive information or insights is still a big challenge in Big Data. The main objective of this work is to present an implementation of a Big Data project, using data originated from social networks, as well as text mining techniques and machine learning, through the implementation of K-Means and SVM algorithms, with intention of identifying patterns from dengue outbreaks, through analyzes that show insights of probable outbreaks of dengue in a particular region of Brazil. The results obtained indicate that the implemented project had a satisfactory yield if compared of the data collected of the Ministry of Health of Brazil, thus indicating a potential for utilization of its purpose. It is observed that the main advantage of the analyzes in Big Data is related to the possibility of use the unstructured data that can be obtained in social networks, e-commerce sites, among others and structured data obtained from traditional databases and from this union, to extract information from values that can be used to benefit of organizations, thus allowing know the future behaviors and thereby act in a preventive way.
机译:大数据一词可以定义为分析非结构化数据库,不同细分市场和规模的组织中的大量数据,并一直采用大数据分析哲学作为支持工具,以战略性地预测有关消费者及其行为的有价值的见解和趋势。期望,从而在他们经营的市场中获得竞争优势。但是,从值中提取信息作为将大量数据转换为预测性信息或洞察力的目标仍然是大数据中的一大挑战。这项工作的主要目的是通过实施K-Means和SVM算法,利用社交网络中的数据以及文本挖掘技术和机器学习,提出一个大数据项目的实施方案,以期确定模式从登革热暴发到分析,这些分析显示了在巴西特定地区可能爆发的登革热的见解。所获得的结果表明,如果与巴西卫生部收集的数据进行比较,则该实施项目的收成令人满意,从而表明了利用其目的的潜力。可以看出,大数据分析的主要优势与以下可能性有关:使用可以在社交网络,电子商务站点等中获得的非结构化数据,以及从传统数据库和该联合会获得的结构化数据,从可用于组织受益的价值中提取信息,从而允许了解未来的行为,从而采取预防措施。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号