Towards Development of Clustering Applications for Large-Scale Comparative Genotyping and Kinship Analysis Using Y-Short Tandem Repeats

机译：面向使用Y短串联重复序列进行大规模比较基因分型和亲缘关系分析的聚类应用程序的开发

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Y-chromosome short tandem repeats (Y-STRs) are genetic markers with practical applications in human identification. However, where mass identification is required (e.g., in the aftermath of disasters with significant fatalities), the efficiency of the process could be improved with new statistical approaches. Clustering applications are relatively new tools for large-scale comparative genotyping, and the k-Approximate Modal Haplotype (k-AMH), an efficient algorithm for clustering large-scale Y-STR data, represents a promising method for developing these tools. In this study we improved the k-AMH and produced three new algorithms: the Nk-AMH I (including a new initial cluster center selection), the Nk-AMH II (including a new dominant weighting value), and the Nk-AMH III (combining I and II). The Nk-AMH III was the superior algorithm, with mean clustering accuracy that increased in four out of six datasets and remained at 100% in the other two. Additionally, the Nk-AMH III achieved a 2% higher overall mean clustering accuracy score than the k-AMH, as well as optimal accuracy for all datasets (0.84–1.00). With inclusion of the two new methods, the Nk-AMH III produced an optimal solution for clustering Y-STR data; thus, the algorithm has potential for further development towards fully automatic clustering of any large-scale genotypic data.

机译：Y染色体短串联重复序列（Y-STR）是遗传标记，在人类识别中具有实际应用。但是，在需要进行大规模识别的情况下（例如，在重大伤亡灾难之后），可以通过新的统计方法来提高流程的效率。聚类应用是用于大规模比较基因分型的相对较新的工具，k-近似模态单倍型（k-AMH）是一种用于对大型Y-STR数据进行聚类的有效算法，代表了开发这些工具的一种有前途的方法。在这项研究中，我们改进了k-AMH并产生了三种新算法：Nk-AMH I（包括新的初始聚类中心选择），Nk-AMH II（包括新的主要加权值）和Nk-AMH III （结合I和II）。 Nk-AMH III是更好的算法，平均聚类准确性在六个数据集中的四个中增加，而在其他两个数据集中则保持在100％。此外，Nk-AMH III的总体平均聚类准确度得分比k-AMH高2％，并且所有数据集的最佳准确度（0.84–1.00）。结合了这两种新方法，Nk-AMH III为聚类Y-STR数据提供了最佳解决方案。因此，该算法有潜力进一步发展为任何大规模基因型数据的全自动聚类。

著录项

期刊名称 OMICS : a Journal of Integrative Biology
作者
Ali Seman; Azizian Mohd Sapawi; Mohd Zaki Salleh;
展开▼
作者单位

展开▼
年(卷),期 -1(19),6
年度 -1
页码 361–367
总页数 7
原文格式 PDF
正文语种
中图分类遗传学;应用微生物学;
关键词

相似文献

外文文献
中文文献
专利

1. First Y-Short Tandem Repeat Categorical Dataset for Clustering Applications [J] . AliSeman, ZainabAbu Bakar, Mohamed NizamIsa Dataset Papers in Science . 2013,第1期

机译：用于聚类应用程序的第一个Y短串联重复分类数据集
2. Development of multiple-locus variable-number tandem-repeat analysis for rapid genotyping of Ehrlichia ruminantium and its application to infected Amblyomma variegatum collected in heartwater endemic areas in Uganda. [J] . Nakao R, Morrison LJ, Zhou L, Parasitology . 2012,第1期

机译：多位点可变数目串联重复分析技术在反刍动物埃里希氏菌快速基因分型中的应用及其在乌干达心水地方病地区收集的被感染的植物盲虫的应用。
3. Development of genome-wide informative simple sequence repeat markers for large-scale genotyping applications in chickpea and development of web resource [J] . Swarup K. Parida, Mohit Verma, Santosh K. Yadav, Frontiers in Plant Science . 2015,第1期

机译：鹰嘴豆中大规模基因分型应用的全基因组信息简单序列重复标记的开发和网络资源的开发
4. Hard and soft updating centroids for clustering Y-short tandem repeats (Y-STR) data [C] . Seman A., Bakar Z.A., Daud N. 2010 IEEE Conference on Open Systems . 2010

机译：硬更新和软更新质心，用于聚类Y-短串联重复（Y-STR）数据
5. Applications of variable number tandem repeat genotyping in the validation of an animal medical model and gene flow studies in threatened populations of reptiles. [D] . Smith, Candace D. 2009

机译：可变数目串联重复基因分型在动物医学模型验证和爬行动物濒危种群的基因流研究中的应用。
6. Development of genome-wide informative simple sequence repeat markers for large-scale genotyping applications in chickpea and development of web resource [O] . Swarup K. Parida, Mohit Verma, Santosh K. Yadav, 2015

机译：鹰嘴豆大规模基因分型应用的全基因组信息简单序列重复标记的开发和网络资源的开发
7. Towards Development of Clustering Applications for Large-Scale Comparative Genotyping and Kinship Analysis Using Y-Short Tandem Repeats [O] . Ali Seman, Azizian Mohd Sapawi, Mohd Zaki Salleh 2015

机译：利用Y短串联重复发展大规模比较基因分型和亲属性分析的聚类应用的发展
8. Tandem Repeat Regions within the Burkholderia pseudomallei Genome and their Application for High-Resolution Genotyping [R] . U'Ren, J. M. , Schupp, J. M. , Pearson, T. , 2007

机译：Burkholderia pseudomallei基因组内的串联重复区及其在高分辨率基因分型中的应用

Towards Development of Clustering Applications for Large-Scale Comparative Genotyping and Kinship Analysis Using Y-Short Tandem Repeats

摘要

著录项

相似文献

相关主题

期刊订阅