...
首页> 外文期刊>Big Data Analytics >A comprehensive model for management and validation of federal big data analytical systems
【24h】

A comprehensive model for management and validation of federal big data analytical systems

机译:联邦大数据分析系统管理和验证的综合模型

获取原文
           

摘要

BackgroundIn this era of data science, many software vendors are rushing towards providing better solutions for data management, analytics, validation and security. The government, being one of the most important customers, is riding the wave of data and business intelligence. However, federal agencies have certain requirements and bureaucracies for data-related processes, certain rules and specific regulations that would entail special models for building and managing data analytical systems. In this paper, and based on work done at the US government, a model for data management and validation is introduced: Federal Model for Data Management and Validation (FedDMV). FedDMV is 4-step model that has a set of best practices, databases, software tools and analytics. Automated procedures are used to develop the system and maintain it, and association rules are used for improving its quality. ResultsAfter working with multiple engineers and analysts at the federal agency, there is a general consent that FedDMV is easy to follow (please refer to the experimental survey). However, to quantify that satisfaction, three experimental studies were performed. One is a comparison to other state-of-the-art development models at the government, the second one is a survey that was collected at the government to quantify the level of satisfaction regarding FedDMV and its tool; and finally, a data validation study was performed through detailed testing of the federal system (using an Association Rules algorithm). ConclusionsTo develop a safe and sound federal data analytical system, a tested and rigorous model is required. There is a lack of government-specific models in industry and research. FedDMV aims to provide solutions and guided steps to facilitate the development of data analytics systems given the governmental constraints. FedDMV deals with unstructured data that streams from multiple sources, automates steps that are usually manual, validates the data and maximizes its security. The results of the experimental work are recorded and reported in this manuscript.
机译:背景技术在当今的数据科学时代,许多软件供应商都在努力为数据管理,分析,验证和安全性提供更好的解决方案。作为最重要的客户之一,政府正乘着数据和商业智能的浪潮。但是,联邦机构对与数据相关的流程,某些规则和特定法规有一定的要求和官僚机构,这将需要特殊的模型来构建和管理数据分析系统。本文基于美国政府所做的工作,介绍了一种数据管理和验证模型:联邦数据管理和验证模型(FedDMV)。 FedDMV是一个四步模型,具有一组最佳实践,数据库,软件工具和分析。自动化程序用于开发和维护系统,而关联规则则用于提高其质量。结果与联邦机构的多个工程师和分析师合作后,人们普遍同意FedDMV易于遵循(请参考实验调查)。但是,为了量化满意度,我们进行了三项实验研究。第一个是与政府的其他最新发展模型进行比较,第二个是政府收集的一项调查,用于量化对FedDMV及其工具的满意度。最后,通过对联邦系统进行详细测试(使用关联规则算法)进行了数据验证研究。结论为了开发安全可靠的联邦数据分析系统,需要经过测试且严格的模型。在行业和研究中缺乏政府特定的模型。由于政府的限制,FedDMV旨在提供解决方案和指导步骤,以促进数据分析系统的开发。 FedDMV处理来自多个源的非结构化数据,自动执行通常是手动的步骤,验证数据并最大化其安全性。实验工作的结果记录在本手稿中并报告。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号