Statistical Modeling of Large-Scale Simulation Data

机译：大规模仿真数据的统计建模

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

With the advent of fast computer systems, Scientists are now able to generate terabytes of simulation data. Unfortunately, the shear size of these data sets has made efficient exploration of them impossible. To aid scientists in gathering knowledge from their simulation data, we have developed an ad-hoc query infrastructure. Our system, called AQSim (short for Ad-hoc Queries for Simulation) reduces the data storage requirements and access times in two stages. First, it creates and stores mathematical and statistical models of the data. Second, it evaluates queries on the models of the data instead of on the entire data set. In this paper, we present two simple but highly effective statistical modeling techniques for simulation data. Our first modeling technique computes the true mean of systematic partitions of the data. It makes no assumptions about the distribution of the data and uses a variant of the root mean square error to evaluate a.model. In our second statistical modeling technique, we use the Andersen-Darling goodness-of-fit method on systematic partitions of the data. This second method evaluates a model by how well it passes the normality test on the data. Both of our statistical models summarize the data so as to answer range queries in the most effective way. We calculate precision on an answer to a query by scaling the one-sided Chebyshev Inequalities with the original mesh's topology. Our experimental evaluations on two scientific simulation data sets illustrate the value of using these statistical modeling techniques on large simulation data sets.

著录项

作者
Eliassi-Rad, T.; Critchlow, T.; Abdulla, G.;
展开▼
作者单位

展开▼
年度 2002
页码 p.1-14
总页数 14
原文格式 PDF
正文语种 eng
中图分类工业技术;
关键词
Statistical models; Data sets; Data storage; Algorithms;

机译：统计模型;数据集;数据存储;算法;

相似文献

外文文献
中文文献
专利

1. Statistical modelling and direct numerical simulations of decaying stably stratified turbulence. Part 2. Large-scale and small-scale anisotropy [J] . Godeferd FS., Staquet C. Journal of Fluid Mechanics . 2003,第0期

机译：衰减稳定分层湍流的统计模型和直接数值模拟。第2部分。大型和小型各向异性
2. The statistical analysis of multivariate failure time data: A marginal modeling approach , Ross L. Prentice , Shanshan Zhao , Boca Raton, FL : CRC Press . The statistical analysis of multivariate failure time data: A marginal modeling approach The statistical analysis of multivariate failure time data: A marginal modeling approach , Ross L. Prentice Ross L. Ross L. Prentice Prentice , Shanshan Zhao Shanshan Shanshan Zhao Zhao , Boca Raton, FL Boca Raton, FL : CRC Press CRC Press . [J] . Lin D. Y. Biometrics: Journal of the Biometric Society : An International Society Devoted to the Mathematical and Statistical Aspects of Biology . 2019,第4期

机译：多变量故障时间数据的统计分析：边缘建模方法，罗斯L. Prentice，山山赵，博卡拉顿，FL：CRC压力机。多元故障时间数据的统计分析：边缘建模方法多元故障时间数据的统计分析：边缘建模方法，罗斯L. Prentice Ross L. Ross L. Prentice Prentice，Shanshan Zhao Shanshan Shanshan Zhao Zhao，Boca Raton， FL BOCA RATON，FL：CRC按CRC压力机。
3. Comparison of statistical and machine learning models for healthcare cost data: a simulation study motivated by Oncology Care Model (OCM) data [J] . Madhu Mazumdar, Jung-Yi Joyce Lin, Wei Zhang, BMC Health Services Research . 2020,第1期

机译：医疗成本数据统计和机器学习模型的比较：肿瘤护理模型（OCM）数据激励的仿真研究
4. Statistical modeling of large-scale simulation data [C] . Tina Eliassi-Rad, Terence Critchlow, Ghaleb Abdulla Proceedings of the Eighth ACM SIGKDD international conference on knowledge discovery and data mining(KDD-2000) . 2002

机译：大规模仿真数据的统计建模
5. An Exploration of Statistical Modelling Methods on Simulation Data Case Study: Biomechanical Predator-Prey Simulations [D] . Seto, Christian. 2018

机译：仿真数据案例研究统计建模方法的探索：生物力学捕食者 - 猎物模拟
6. Comparison of statistical and machine learning models for healthcare cost data: a simulation study motivated by Oncology Care Model (OCM) data [O] . Madhu Mazumdar, Jung-Yi Joyce Lin, Wei Zhang, 2020

机译：用于医疗保健费用数据的统计模型和机器学习模型的比较：由肿瘤护理模型（OCM）数据驱动的模拟研究
7. Statistical Modeling of Large-Scale Simulation Data [O] . Tina Eliassi-rad, Terence Critchlow 2002

机译：大规模仿真数据的统计建模

Statistical Modeling of Large-Scale Simulation Data

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅