首页> 外文会议>Asian conference on intelligent information and database systems >Measuring Data Completeness for Microbial Genomics Database
【24h】

Measuring Data Completeness for Microbial Genomics Database

机译:测量微生物基因组数据库的数据完整性

获取原文

摘要

Poor quality data such as data with missing values (or records) cause negative consequences in many application domains. An important aspect of data quality is completeness. One problem in data completeness is the problem of missing individuals in data sets. Within a data set, the individuals refer to the real world entities whose information is recorded. So far, in completeness studies however, there has been little discussion about how missing individuals are assessed. In this paper, we propose the notion of population-based completeness (PBC) that deals with the missing individuals problem, with the aim of investigating what is required to measure PBC and to identify what is needed to support PBC measurements in practice. This paper explores the need of PBC in the microbial genomics where real sample data sets retrieved from a microbial database called Comprehensive Microbial Resources are used (CMR).
机译:质量低劣的数据(例如缺少值(或记录)的数据)会在许多应用程序域中造成负面影响。数据质量的一个重要方面是完整性。数据完整性中的一个问题是数据集中缺少个人的问题。在数据集中,个人指的是记录了信息的现实世界实体。到目前为止,在完整性研究中,关于如何评估失踪人员的讨论很少。在本文中,我们提出了基于人口的完整性(PBC)的概念,该概念处理失踪的个人问题,目的是调查测量PBC所需的条件,并确定在实践中支持PBC测量所需的条件。本文探讨了微生物基因组学中PBC的需求,其中使用了从称为综合微生物资源(CMR)的微生物数据库中检索到的真实样本数据集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号