首页> 美国卫生研究院文献>other >Using Hamming Distance as Information for SNP-Sets Clustering and Testing in Disease Association Studies

【2h】

Using Hamming Distance as Information for SNP-Sets Clustering and Testing in Disease Association Studies

机译：在疾病关联研究中使用汉明距离作为SNP集聚类和测试的信息

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

The availability of high-throughput genomic data has led to several challenges in recent genetic association studies, including the large number of genetic variants that must be considered and the computational complexity in statistical analyses. Tackling these problems with a marker-set study such as SNP-set analysis can be an efficient solution. To construct SNP-sets, we first propose a clustering algorithm, which employs Hamming distance to measure the similarity between strings of SNP genotypes and evaluates whether the given SNPs or SNP-sets should be clustered. A dendrogram can then be constructed based on such distance measure, and the number of clusters can be determined. With the resulting SNP-sets, we next develop an association test HDAT to examine susceptibility to the disease of interest. This proposed test assesses, based on Hamming distance, whether the similarity between a diseased and a normal individual differs from the similarity between two individuals of the same disease status. In our proposed methodology, only genotype information is needed. No inference of haplotypes is required, and SNPs under consideration do not need to locate in nearby regions. The proposed clustering algorithm and association test are illustrated with applications and simulation studies. As compared with other existing methods, the clustering algorithm is faster and better at identifying sets containing SNPs exerting a similar effect. In addition, the simulation studies demonstrated that the proposed test works well for SNP-sets containing a large proportion of neutral SNPs. Furthermore, employing the clustering algorithm before testing a large set of data improves the knowledge in confining the genetic regions for susceptible genetic markers.

机译：高通量基因组数据的可用性在最近的遗传关联研究中引发了一些挑战，包括必须考虑的大量遗传变异以及统计分析的计算复杂性。通过标记集研究（例如SNP集分析）解决这些问题可能是一种有效的解决方案。为了构建SNP集，我们首先提出一种聚类算法，该算法使用汉明距离来测量SNP基因型字符串之间的相似性，并评估是否应该对给定的SNP或SNP集进行聚类。然后可以基于这种距离度量来构建树状图，并且可以确定簇的数量。利用生成的SNP集，我们接下来开发一种关联测试HDAT，以检查对目标疾病的敏感性。该提议的测试基于汉明距离来评估患病个体与正常个体之间的相似性是否不同于具有相同疾病状态的两个个体之间的相似性。在我们提出的方法中，仅需要基因型信息。无需推断单倍型，所考虑的SNP无需位于附近区域。通过应用和仿真研究说明了所提出的聚类算法和关联测试。与其他现有方法相比，该聚类算法可以更快，更好地识别包含具有相似效果的SNP的集合。此外，仿真研究表明，所提出的测试对于包含大量中性SNP的SNP集效果很好。此外，在测试大量数据之前采用聚类算法可以改善将遗传区域限定在易感遗传标记中的知识。

著录项

期刊名称 other
作者
Charlotte Wang; Wen-Hsin Kao; Chuhsing Kate Hsiao;
展开▼
作者单位

展开▼
年(卷),期 -1(10),8
年度 -1
页码 e0135918
总页数 24
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Adaptive SNP-Set Association Testing in Generalized Linear Mixed Models with Application to Family Studies [J] . Park Jun Young, Wu Chong, Basu Saonli, Behavior Genetics: An International Journal Devoted to Research in the Inheritance of Behavior in Animals and Man . 2018,第1期

机译：广义线性混合模型中的自适应SNP集合测试，应用于家庭研究
2. Adaptive SNP-Set Association Testing in Generalized Linear Mixed Models with Application to Family Studies [J] . Park Jun Young, Wu Chong, Basu Saonli, Behavior Genetics: An International Journal Devoted to Research in the Inheritance of Behavior in Animals and Man . 2018,第1期

机译：广义线性混合模型中的自适应SNP集合测试，应用于家庭研究
3. The Generalized Higher Criticism for Testing SNP-Set Effects in Genetic Association Studies [J] . Barnett Ian, Mukherjee Rajarshi, Lin Xihong Journal of the American statistical association . 2017,第517期

机译：在遗传关联研究中测试SNP集效应的广义高级批评
4. 4 Set-Based Gene × Environment Interaction Tests for Complex Diseases with Application to Genome-Wide Association and Sequencing Studies [C] . Shuo Jiao Symposium of the University of Georgia s Center for Contextual Genetics and Prevention Science . 2016

机译：基于4种基因×环境相互作用试验，复杂疾病应用于基因组 - 宽协会和测序研究
5. SNP-set Tests for Sequencing and Genome-Wide Association Studies. [D] . Barnett, Ian James. 2014

机译：用于测序和全基因组关联研究的SNP设置测试。
6. The Generalized Higher Criticism for Testing SNP-Set Effects in Genetic Association Studies [O] . Ian Barnett, Rajarshi Mukherjee, Xihong Lin -1

机译：在遗传关联研究中测试SNP集效应的广义高级批评
7. Using Hamming Distance as Information for SNP-Sets Clustering and Testing in Disease Association Studies. [O] . Charlotte Wang, Wen-Hsin Kao, Chuhsing Kate Hsiao 2015

机译：使用汉明距离作为疾病关联研究中sNp集聚类和测试的信息。

Using Hamming Distance as Information for SNP-Sets Clustering and Testing in Disease Association Studies

摘要

著录项

相似文献

相关主题

期刊订阅