首页> 外文会议>Asian Conference on Intelligent Information and Database Systems >Measuring Data Completeness for Microbial Genomics Database
【24h】

Measuring Data Completeness for Microbial Genomics Database

机译:测量微生物基因组学数据库的数据完整性

获取原文

摘要

Poor quality data such as data with missing values (or records) cause negative consequences in many application domains. An important aspect of data quality is completeness. One problem in data completeness is the problem of missing individuals in data sets. Within a data set, the individuals refer to the real world entities whose information is recorded. So far, in completeness studies however, there has been little discussion about how missing individuals are assessed. In this paper, we propose the notion of population-based completeness (PBC) that deals with the missing individuals problem, with the aim of investigating what is required to measure PBC and to identify what is needed to support PBC measurements in practice. This paper explores the need of PBC in the microbial genomics where real sample data sets retrieved from a microbial database called Comprehensive Microbial Resources are used (CMR).
机译:诸如具有缺失值(或记录)的数据等质量数据造成许多应用域中的负面后果。数据质量的一个重要方面是完整性。数据完整性中的一个问题是数据集中丢失的问题。在数据集中,个人指的是记录信息的真实世界实体。到目前为止,在完整性研究中,关于如何评估失踪的人,几乎没有讨论。在本文中,我们提出了对缺少个人问题的基于人口的完整性(PBC)的概念,其目的是调查衡量PBC所需的内容,并确定支持PBC测量的实践所需的内容。本文探讨了PBC在微生物基因组中的需求,其中使用从称为综合微生物资源的微生物数据库检索的真实样本数据集(CMR)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号