Intra-Cluster Distance Minimization in DNA Methylation Analysis Using an Advanced Tabu-Based Iterative kk-Medoids Clustering Algorithm (T-CLUST)

Damgacioglu Haluk; Celik Emrah; Celik Nurcin

首页> 外文期刊>IEEE/ACM transactions on computational biology and bioinformatics >Intra-Cluster Distance Minimization in DNA Methylation Analysis Using an Advanced Tabu-Based Iterative kk-Medoids Clustering Algorithm (T-CLUST)

【24h】

Intra-Cluster Distance Minimization in DNA Methylation Analysis Using an Advanced Tabu-Based Iterative kk-Medoids Clustering Algorithm (T-CLUST)

机译：使用先进的禁忌迭代KK-METERINGS聚类算法（T-CLUST）中的簇内距离最小化DNA甲基化分析中的最小化

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Recent advances in DNA methylation profiling have paved the way for understanding the underlying epigenetic mechanisms of various diseases such as cancer. While conventional distance-based clustering algorithms (e.g., hierarchical and k-means clustering) have been heavily used in such profiling owing to their speed in conduct of high-throughput analysis, these methods commonly converge to suboptimal solutions and/or trivial clusters due to their greedy search nature. Hence, methodologies are needed to improve the quality of clusters formed by these algorithms without sacrificing from their speed. In this study, we introduce three related algorithms for a complete high-throughput methylation analysis: a variance-based dimension reduction algorithm to handle high-dimensionality in data, an outlier detection algorithm to identify the outliers of data, and an advanced Tabu-based iterative k-medoids clustering algorithm (T-CLUST) to reduce the impact of initial solutions on the performance of conventional k-medoids algorithm. The performance of the proposed algorithms is demonstrated on nine different real DNA methylation datasets obtained from the Gene Expression Omnibus DataSets database. The accuracy of the cluster identification obtained by our proposed algorithms is higher than those of hierarchical and k-means clustering, as well as the conventional methods. The algorithms are implemented in MATLAB, and available at: http://www.coe.miami.edu/simlab/ tclust.html.

机译：DNA甲基化分析的最新进展已经为理解癌症等各种疾病的潜在表观遗传机制铺平了道路。虽然常规距离的聚类算法（例如，等级和k均值聚类）由于其在进行高通量分析的速度而大量使用，但是这些方法通常会收敛到次优的解决方案和/或琐碎的群体他们贪婪的搜索性质。因此，需要方法来提高这些算法形成的簇的质量而不会从它们的速度牺牲。在这项研究中，我们介绍了三种相关算法，用于完整的高通量甲基化分析：基于方差的维度缩小算法，用于处理数据中的高维性，一个异常值检测算法，以识别数据的异常值，以及基于高级禁忌的曲折迭代k-yemoids聚类算法（T-Clust），以减少初始解对常规k麦考斯算法性能的影响。在从基因表达式omnibus数据集数据库获得的九种不同实际DNA甲基化数据集上证明了所提出的算法的性能。我们所提出的算法获得的群集识别的准确性高于分层和K-Means聚类，以及传统方法。算法在MATLAB中实现，可用于：http：//www.coe.miami.edu/simlab/ tclust.html。

著录项

来源
《IEEE/ACM transactions on computational biology and bioinformatics》 |2020年第4期|1241-1252|共12页
作者
Damgacioglu Haluk; Celik Emrah; Celik Nurcin;
展开▼
作者单位

Univ Miami Dept Ind Engn Coral Gables FL 33146 USA;

Univ Miami Dept Mech & Aerosp Coral Gables FL 33146 USA;

Univ Miami Dept Ind Engn Coral Gables FL 33146 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Biomarker identification; clustering; DNA methylation analysis; k-medoids clustering; outlier detection;

机译：生物标志物鉴定;聚类;DNA甲基化分析;K-麦细聚类;异常检测;

相似文献

外文文献
中文文献
专利

1. Genome-wide DNA methylation analysis of articular chondrocytes reveals a cluster of osteoarthritic patients* -Genome-wide DNA methylation study of hip and knee cartilage reveals embryonic organ and skeletal system morphogenesis as major pathways involved in osteoarthritis- DNA Methylation in Osteoarthritis [J] . Agro Food Industry Hi-Tech . 2018,第1期

机译：关节性软骨细胞的基因组DNA甲基化分析揭示了髋关节和膝关节软骨的骨抑制患者群* -Genome-宽的DNA甲基化研究显示胚胎器官和骨骼系形态发生，作为骨关节炎中骨关节炎甲基化的主要途径
2. Inter- and intra-cluster movement of mobile sink algorithms for cluster-based networks to enhance the network lifetime [J] . Gharaei Niayesh, Abu Bakar Kamalrulnizam, Hashim Siti Zaiton Mohd, Ad hoc networks . 2019,第MARa期

机译：基于集群的网络的移动宿算法在集群间和集群内移动，以延长网络寿命
3. An improved iterated greedy algorithm with a Tabu-based reconstruction strategy for the no-wait flowshop scheduling problem [J] . Ding Jian-Ya, Song Shiji, Gupta Jatinder N. D., Applied Soft Computing . 2015,第Null期

机译：改进的迭代贪婪算法和基于禁忌的重构策略，用于无等待流水车间调度问题
4. IK-BKM: An incremental clustering approach based on intra-cluster distance [C] . Ben Hariz Sarra, Elouedi Zied 2010 IEEE/ACS International Conference on Computer Systems and Applications . 2010

机译：IK-BKM：基于集群内距离的增量集群方法
5. Characterization of DNA methylation patterns in tumorigenesis and the use of DNA methylation analysis to gauge toxic potential. [D] . Watson, Rebecca Erin. 2004

机译：DNA甲基化模式在肿瘤发生中的表征，以及使用DNA甲基化分析来测定毒性潜力。
6. The role of cluster size and intra-cluster correlations when adjusting for covariates in the analysis of cluster randomised trials [O] . Neil Wright 2015

机译：调整协变量时聚类大小和聚类内相关性在聚类随机试验分析中的作用
7. An Improved K-modes Clustering Algorithm Based on Intra-cluster and Inter-cluster Dissimilarity Measure [O] . Hongfang Zhou, Yihui Zhang, Yibin Liu 2017

机译：基于簇内和群集间不同测量的改进的K模聚类算法

Intra-Cluster Distance Minimization in DNA Methylation Analysis Using an Advanced Tabu-Based Iterative kk-Medoids Clustering Algorithm (T-CLUST)

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅