首页> 外文会议>Conference on Technologies and Applications of Artificial Intelligence >Tutorial IV computational intelligence for data analytics
【24h】

Tutorial IV computational intelligence for data analytics

机译:教程IV计算智能,用于数据分析

获取原文

摘要

Humankind has been collecting data since the recording started, but in the last decade with the considerable advances in computing and storage technologies, advancements of cloud computing, development of ubiquitous connectivity and the internet of things, there has been explosion in the size and variety of collected data. Nevertheless, one can be data-rich and knowledge-poor, and this is where the data analytics and the development and application of machine learning models become necessity for gaining insight of complex processes to prove scientific theories and discoveries, support decision making and enhance strategic planning in different areas of the economy, finance, industry, healthcare, etc. Recently, there is an influx of polymorphic, unstructured and multimodal data - social media, images, audio, video, etc., which is complicating further the data processing and knowledge extraction process. But even the traditional structured datasets present problems that need to be addressed and overcome in the early stages of data pre-processing, feature extraction and feature selection. This is because they usually contain variety of data formats, e.g., categorical, continuous, ordinal, and frequently missing data (usually result of sensors faults, human errors, collection, transportation, or storage problems). The most popular approaches in dealing with missing data generally fall in three groups: Deletion methods; Single imputation methods; and Model-based methods [1].
机译:自录制开始以来,人类一直在收集数据,但是在过去的十年中,随着计算和存储技术的巨大进步,云计算的进步,无处不在的连接性和物联网的发展,...收集的数据。然而,数据可能丰富而知识匮乏,这就是数据分析以及机器学习模型的开发和应用成为获取复杂过程的洞察力以证明科学理论和发现,支持决策并增强战略性的必要条件。最近,涌现了多态,非结构化和多模式数据-社交媒体,图像,音频,视频等,这进一步使数据处理和处理变得更加复杂。知识提取过程。但是,即使是传统的结构化数据集,也存在着在数据预处理,特征提取和特征选择的早期阶段需要解决和克服的问题。这是因为它们通常包含各种数据格式,例如,分类,连续,序数和经常丢失的数据(通常是传感器故障,人为错误,收集,运输或存储问题的结果)。处理丢失数据的最流行方法通常分为三类:删除方法;单一插补方法;和基于模型的方法[1]。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号