首页> 外文会议>Proceedings of the 77th ASISamp;T annual meeting, Connecting collections, cultures, and communities >Parameter tuning: Exposing the gap between data curation and effective data analytics
【24h】

Parameter tuning: Exposing the gap between data curation and effective data analytics

机译:参数调整:揭示数据管理与有效数据分析之间的差距

获取原文
获取原文并翻译 | 示例

摘要

The “big data” movement promises to deliver betterrndecisions in all aspects of our lives from business to sciencernhealth, and government by using computational techniquesrnto identify patterns from large historical collections of data.rnAlthough a unified view from curation to analysis has beenrnproposed, current research appears to have polarized intorntwo separate groups: those curating large datasets and thoserndeveloping computational methods to identify patterns inrnlarge datasets. The case study presented here demonstratesrnthe enormous impact that parameter tuning can have on thernresulting accuracy, precision, and recall of a computationalrnmodel that is generated from data. It also illustrates thernvastness of the parameter space that must be searched inrnorder to produce optimal models and curated in order tornavoid redundant experiments. This highlights the need forrnresearch that focuses on the gap between collection andrnanalytics if we are to realize the potential of big data.
机译:“大数据”运动有望通过使用计算技术从大型历史数据集中识别模式,从而从商业到科学,再到政府,为我们生活的各个方面做出更好的决策。尽管提出了从策展到分析的统一观点,但目前的研究似乎正在出现。分为两大类:策划大型数据集和开发计算方法以识别大型数据集的模式。本文介绍的案例研究表明,参数调整可能会对数据生成的计算模型的准确性,精度和召回率产生巨大影响。它还说明了参数空间的广度,必须对其进行无序搜索以生成最佳模型,并对其进行整理以避免重复实验。如果我们要认识到大数据的潜力,这突出了需要进行研究的重点,即集中于收集和分析之间的差距。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号