【24h】

Structured and Unstructured Big Data Analytics

机译:结构化和非结构化大数据分析

获取原文

摘要

The volume of data in the world is growing very fast and generated from verity of sources like social media, sensors airline industry or scientific data in different formats. Biggest challenge is how to infer meaningful insights from such a varietyful and big data along with concern of data storage and management of fast growing data. The size of the databases used in today’s enterprises has been growing at exponential rates day by day. Hence, industries requirement to quickly process and analyze the big data volumes for business decision making and customer insights has also grown exponentially. Data pouring from various sources may be can be structured or unstructured in nature. Structured data refers to a relatively well-organized information, which can be further inserted into traditional RDBMS. As Traditional RDBMS are efficient and easy queries by simple, straightforward search algorithms or SQL queries. In contrast to structured data, unstructured data can be considered as information, which does not, comes in a pre-defined data format, well organized data storage model, or cannot be stored well into relational tables. It is assumed to be fastest growing type of data, e.g. image, sensors data, web chats, social networking messaging data, video, documents, log files, and email data. There are many techniques and software available, which can process and provide efficient storage of unstructured data and help organization to perform analytics on unstructured data. Unstructured data does not well-organized and not stored in predefined manner e.g. logs, web chats. The variety and on ordered nature of data makes storage methods and structure makes execution a time and resource-consuming affair. Advancement into technology has open floodgates to push huge volume of unstructured type of data. Multimedia data is one of the example of unstructured big data, which spans all over the Internet. This needs high execution capability to extract useful information. Rapid processing of multimedia data such as video is important for e.g. criminal investigations, surveillance monitoring, news analysis, sports analytics domain, emotion extraction, etc. Hence, analysis of multimedia data in minimum timeframe is one of the latest research areas. Therefore, we have researched techniques for analyzing unstructured data to extract meaningful information hidden in the big data. In addition, we will describe about various techniques and software used to Manage, process unstructured big data in efficient manner, and increases the performance of complexity analysis.
机译:世界上数据的数量正在增长非常快,从社交媒体等资源,传感器的符合性,传感器,传感器,以不同的格式。最大的挑战是如何从这些品种和大数据中推断出有意义的见解以及数据存储和快速增长数据的管理。今天的企业中使用的数据库的大小在日复一日的指数利率上成长。因此,行业要求快速处理和分析业务决策和客户洞察的大数据量也呈指数增长。从各种来源浇注的数据可以在自然界中结构或非结构化。结构化数据是指相对良好的组织信息,可以进一步插入传统的RDBMS中。由于传统的RDBMS通过简单,简单的搜索算法或SQL查询是高效且简单的查询。与结构化数据相比,非结构化数据可以被视为没有的信息,这些信息以预定义的数据格式,井有组织的数据存储模型,或者不能存储在关系表中。假设是增长最快的数据类型,例如,图像,传感器数据,网络聊天,社交网络消息数据,视频,文档,日志文件和电子邮件数据。有许多技术和软件可用,可以处理和提供非结构化数据和帮助组织的有效存储,以在非结构化数据上执行分析。非结构化数据没有很好地组织,而不是以预定义的方式存储。日志,网页聊天。数据的各种和有序性质使得存储方法和结构使执行时间和资源消耗。进入技术的进步已开放闸门,以推动大量的非结构化类型数据。多媒体数据是非结构化大数据的例子之一,它在互联网上跨越。这需要高执行能力来提取有用的信息。诸如视频之类的多媒体数据的快速处理对于例如,这是重要的。刑事调查,监测监测,新闻分析,体育分析领域,情感提取等,最小时间范围内的多媒体数据是最新的研究领域之一。因此,我们研究了用于分析非结构化数据的技术,以提取隐藏在大数据中的有意义信息。此外,我们将描述用于管理的各种技术和软件,以有效的方式处理非结构化的大数据,并提高复杂性分析的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号