【24h】

Signal processing for Big Data

机译:大数据信号处理

获取原文

摘要

Summary only from given. We live in an era of data deluge. Pervasive sensors collect massive amounts of information on every bit of our lives, churning out enormous streams of raw data in various formats. Mining information from unprecedented volumes of data promises to limit the spread of epidemics and diseases, identify trends in financial markets, learn the dynamics of emergent social-computational systems, and also protect critical infrastructure including the smart grid and the Internet's backbone network. While Big Data can be definitely perceived as a big blessing, big challenges also arise with large-scale datasets. The sheer volume of data makes it often impossible to run analytics using a central processor and storage, and distributed processing with parallelized multi-processors is preferred while the data themselves are stored in the cloud. As many sources continuously generate data in real time, analytics must often be performed “on-the-fly” and without an opportunity to revisit past entries. Due to their disparate origins, massive datasets are noisy, incomplete, prone to outliers, and vulnerable to cyber-attacks. These effects are amplified if the acquisition and transportation cost per datum is driven to a minimum. Overall, Big Data present challenges in which resources such as time, space, and energy, are intertwined in complex ways with data resources. Given these challenges, ample signal processing opportunities arise. This tutorial lecture outlines ongoing research in novel models applicable to a wide range of Big Data analytics problems, as well as algorithms to handle the practical challenges, while revealing fundamental limits and insights on the mathematical trade-offs involved.
机译:摘要仅来自给定。我们生活在数据泛滥的时代。普及型传感器收集了我们生活中每一个环节的大量信息,从而产生了各种格式的大量原始数据流。从前所未有的数据量中挖掘信息有望限制流行病和疾病的传播,识别金融市场的趋势,了解新兴的社会计算系统的动态,并保护关键的基础设施,包括智能电网和Internet的骨干网。虽然可以肯定地认为大数据是一大福气,但大规模数据集也带来了巨大挑战。庞大的数据量使得通常无法使用中央处理器和存储来运行分析,并且当数据本身存储在云中时,最好使用并行化的多处理器进行分布式处理。由于许多来源连续不断地实时生成数据,因此分析通常必须“即时”执行,并且没有机会重新访问过去的条目。由于它们的来源不同,因此海量数据集嘈杂,不完整,容易出现异常值,并且容易受到网络攻击。如果将每个数据的采集和运输成本降至最低,这些影响就会放大。总体而言,大数据带来了挑战,其中诸如时间,空间和能源之类的资源以复杂的方式与数据资源交织在一起。面对这些挑战,出现了充足的信号处理机会。本教程讲座概述了适用于各种大数据分析问题的新颖模型的正在进行的研究,以及应对实际挑战的算法,同时揭示了所涉及的数学折衷的基本限制和见解。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号