...
首页> 外文期刊>Molecular ecology notes >Analysis of multilocus fingerprinting data sets containing missing data
【24h】

Analysis of multilocus fingerprinting data sets containing missing data

机译:包含缺失数据的多基因座指纹数据集的分析

获取原文
获取原文并翻译 | 示例

摘要

Missing data are commonly encountered using multilocus, fragment-based (dominant) fingerprinting methods, such as random amplified polymorphic DNA (RAPD) or amplified fragment length polymorphism (AFLP). Data sets containing missing data have been analysed by eliminating those bands or samples with missing data, assigning values to missing data or ignoring the problem. Here, we present a method that uses random assignments of band presence-absence to the missing data, implemented by the computer program FAMD (available from http://homepage.univie.ac.at/philipp.maria.schlueter/famd.html), for analyses based on pairwise similarity and Shannon's index. When missing values group in a data set, sample or band elimination is likely to be the most appropriate action. However, when missing values are scattered across the data set, minimum, maximum and average similarity coefficients are a simple means of visualizing the effects of missing data on tree structure. Our approach indicates the range of values that a data set containing missing data points might generate, and forces the investigator to consider the effects of missing values on data interpretation.
机译:使用多基因座,基于片段的(主要)指纹方法通常会遇到数据丢失的情况,例如随机扩增多态性DNA(RAPD)或扩增片段长度多态性(AFLP)。通过消除带有缺失数据的谱带或样本,为缺失数据分配值或忽略问题,对包含缺失数据的数据集进行了分析。在这里,我们提出了一种方法,该方法对计算机上的FAMD(可从http://homepage.univie.ac.at/philipp.maria.schlueter/famd.html获得)使用对缺失数据的频段存在-不存在的随机分配。 ),用于基于成对相似度和香农指数的分析。当数据集中的缺失值组出现时,消除样品或条带可能是最合适的操作。但是,当缺失值分散在整个数据集上时,最小,最大和平均相似系数是一种可视化缺失数据对树结构影响的简单方法。我们的方法指出了包含缺失数据点的数据集可能生成的值的范围,并迫使研究者考虑缺失值对数据解释的影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号