首页> 外文会议>2017 IEEE 3rd International Conference on Engineering Technologies and Social Sciences >Filtering the big data based on volume, variety and velocity by using Kalman filter recursive approach
【24h】

Filtering the big data based on volume, variety and velocity by using Kalman filter recursive approach

机译:使用Kalman滤波器递归方法基于数​​量,种类和速度过滤大数据

获取原文
获取原文并翻译 | 示例

摘要

For the past seven decades the term Big Data is known, but due to the emerging technology shift of this era, it is captivating a lot of attention from the researchers of mathematics, computing, telecommunication, information technology, data warehousing, and mining. As this generation is living in the age of technology where data is playing a vital role and especially the Big Data has lots of success stories, but at the same time it is becoming the biggest threat to network service provider, telecom industry, and homeland security. Every device such as smart phones, laptop, desktop, etc. connected with the network is contributing to add data to a Big Data pool by using different applications. Social media such as Instagram, Facebook, WhatsApp, Apple, Google, Google+, Twitter, Flickr, etc. are few famous tools which are used to add redundant data. The question appears, is it mandatory to store and especially process all the data either useful or redundant? This research paper is focusing for filtering useful data from redundant data by using their parameters which are velocity, variety, and volume. In proposed architecture, Memcache DB (for velocity), Voldemort layers (for variety) and MapReduce (for volume) are linked with Hadoop to achieve filtered data. Kalman filter recursive approach is used to inject the data back into Hadoop Distributed File System to reduce processing cost of next iterations.
机译:在过去的七十年中,“大数据”一词广为人知,但是由于这个时代的新兴技术转变,它吸引了数学,计算,电信,信息技术,数据仓库和采矿等领域的研究人员的极大关注。随着这一代人生活在技术时代,数据起着至关重要的作用,尤其是大数据有许多成功的故事,但与此同时,它正成为对网络服务提供商,电信行业和国土安全的最大威胁。与网络连接的每台设备(例如智能手机,笔记本电脑,台式机等)都通过使用不同的应用程序来向大数据池添加数据。诸如Instagram,Facebook,WhatsApp,Apple,Google,Google +,Twitter,Flickr等社交媒体是少数用于添加冗余数据的著名工具。出现问题了,是否必须存储(尤其是处理所有有用或冗余的数据)?该研究论文的重点是通过使用冗余参数(速度,种类和体积)从冗余数据中过滤出有用数据。在建议的体系结构中,Memcache DB(用于速度),Voldemort层(用于变化)和MapReduce(用于体积)与Hadoop链接以实现过滤数据。 Kalman过滤器递归方法用于将数据注入回Hadoop分布式文件系统,以减少下一次迭代的处理成本。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号