首页> 外文会议>Conference on survey and other telescope technologies and discoveries >Petabyte Scale Data Mining: Dream or Reality?

【24h】

Petabyte Scale Data Mining: Dream or Reality?

机译：Petabyte Scale数据挖掘：梦想或现实？

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Science is becoming very data intensive. Today's astronomy datasets with tens of millions of galaxies already present substantial challenges for data mining. In less than 10 years the catalogs are expected to grow to billions of objects, and image archives will reach Petabytes. Imagine having a 100GB database in 1996, when disk scanning speeds were 30MB/s, and database tools were immature. Such a task today is trivial, almost manageable with a laptop. We think that the issue of a PB database will be very similar in six years. In this paper we scale our current experiments in data archiving and analysis on the Sloan Digital Sky Survey data six years into the future. We analyze these projections and look at the requirements of performing data mining on such data sets. We conclude that the task scales rather well: we could do the job today, although it would be expensive. There do not seem to be any show-stoppers that would prevent us from storing and using a Petabyte dataset six years from today.

机译：科学正变得非常密集。今天的天文数据集具有数以十万个星系已经为数据挖掘带来了大量挑战。在不到10年的时间内，目录预计将增长到数十亿个对象，而图像档案将达到PETABYTES。想象一下1996年拥有100GB的数据库，当磁盘扫描速度为30MB / s时，数据库工具不成熟。今天的这样一项任务是微不足道的，几乎可以使用笔记本电脑。我们认为PB数据库的问题在六年内将非常相似。在本文中，我们将目前的实验扩展了六年的斯隆数字天空调查数据的数据归档和分析中。我们分析了这些预测，并查看在此类数据集上执行数据挖掘的要求。我们得出结论，任务相当稳定：我们今天可以做这项工作，虽然它会很贵。似乎没有任何展示者，可以阻止我们在今天六年来储存和使用Petabyte DataSet。

著录项

来源
《Conference on survey and other telescope technologies and discoveries 》|2002年||共6页
会议地点
作者
Alexander S. Szalay; Jim Gray; Jan Vandenberg;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类天文仪器 ;
关键词
data mining; large-scale computing; databases; spatial statistics;

机译：数据挖掘;大规模计算;数据库;空间统计;

相似文献

外文文献
中文文献
专利

1. Augmented Virtual Reality: Combining Crowd Sensing and Social Data Mining with Large-Scale Simulation Using Mobile Agents for Future Smart Cities [J] . Stefan Bosse, Uwe Engel Proceedings . 2018 ,第1期

机译：增强虚拟现实：将人群传感和社会数据挖掘与使用移动代理商进行大规模仿真，以便将来的智能城市使用移动代理
2. Remote sensing heritage in a petabyte-scale: satellite data and heritage Earth Engine (c) applications [J] . Agapiou Athos International journal of digital Earth . 2017 ,第1a3期

机译：Petabyte-Scale的遥感遗产：卫星数据和遗产地球发动机（C）应用
3. Spectroscopic data handling at petabyte scale [J] . Antony N. Davies, Shane R. Ellis, Benjamin Balluff, Spectroscopy Asia . 2016 ,第2期

机译：PB级光谱数据处理
4. Petabyte Scale Data Mining: Dream or Reality? [C] . Alexander S. Szalay, Jim Gray, Jan Vandenberg Conference on Survey and Other Telescope Technologies and Discoveries; Aug 27-28, 2002; Waikoloa, Hawaii, USA . 2002

机译：PB级数据挖掘：梦想还是现实？
5. Classification of Driver Daydreaming Using Data Mining Techniques. [D] . Miao, Luda. 2012

机译：使用数据挖掘技术对驾驶员做白日梦的分类。
6. Large-Scale Data Mining of Rapid Residue Detection Assay Data From HTML and PDF Documents: Improving Data Access and Visualization for Veterinarians [O] . Majid Jaberi-Douraki, Soudabeh Taghian Dinani, Nuwan Indika Millagaha Gedara, 2021

机译：来自HTML和PDF文件的快速残留检测测定数据的大规模数据挖掘：改善兽医的数据访问和可视化
7. Petabyte Scale Data Mining: Dream or Reality? [O] . Szalay, Alexander S., Gray, Jim, vandenBerg, Jan 2002

机译：petabyte scale Data mining：梦想还是现实？

Petabyte Scale Data Mining: Dream or Reality?

摘要

著录项

相似文献

相关主题

期刊订阅