K-walks: clustering gene-expression data using a K-means clustering algorithm optimised by random walks

Yao Min; Wu Qinghua; Li Juan; Huang Tinghua

首页> 外文期刊>International journal of data mining and bioinformatics >K-walks: clustering gene-expression data using a K-means clustering algorithm optimised by random walks

【24h】

K-walks: clustering gene-expression data using a K-means clustering algorithm optimised by random walks

机译：K-walks：使用随机游走优化的K-means聚类算法对基因表达数据进行聚类

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Gene-expression data obtained from the biological experiments always have thousands of dimensions, which can be very confusing and perplexing to biologists when viewed as a whole. Clustering analysis is an explorative data-mining technique for statistical data analysis that is widely used in gene-expression data analysis. Practical approaches employed for solving the clustering problem use iterative procedures such as K-means, which typically converge to one of many local minima. Here, we propose a simulated annealing approximation algorithm that is optimised using random walks to solve the K-means clustering problem. The algorithm is verified with synthetic and real-world data sets and compared with other well-known K-means variants. The new algorithm is less sensitive to initial cluster centres, and the primary strength of our algorithm is its ability to produce high-quality clustering results for thousands of high-dimensional data. However, the algorithm is computationally intensive.

机译：从生物学实验中获得的基因表达数据总是具有成千上万的维度，从整体上看，这对于生物学家来说是非常困惑和困惑的。聚类分析是一种用于统计数据分析的探索性数据挖掘技术，广泛用于基因表达数据分析中。用于解决聚类问题的实用方法使用迭代过程，例如K-means，通常会收敛到许多局部极小值之一。在这里，我们提出了一种模拟退火近似算法，该算法使用随机游走进行了优化，以解决K均值聚类问题。该算法已通过合成和真实数据集进行了验证，并与其他众所周知的K-means变体进行了比较。新算法对初始聚类中心不那么敏感，我们算法的主要优势在于它能够为数千个高维数据生成高质量的聚类结果。但是，该算法计算量大。

著录项

来源
《International journal of data mining and bioinformatics》 |2016年第2期|共20页
作者
Yao Min; Wu Qinghua; Li Juan; Huang Tinghua;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
gene expression; K-means; random walks;

机译：基因表达;K-均值;随机游动;

相似文献

外文文献
中文文献
专利

1. K-walks: clustering gene-expression data using a K-means clustering algorithm optimised by random walks [J] . Yao Min, Wu Qinghua, Li Juan, SIAM journal on applied dynamical systems . 2017,第2期

机译：K-Walks：使用随机散步优化的K-means聚类算法聚类基因表达数据
2. Optimising Data Using K-Means Clustering Algorithm [J] . Mubeena Shaik, Naseema Shaik, Ahmed Unnisa Begum, International Journal of Engineering Research and Applications . 2015,第4期

机译：使用K均值聚类算法优化数据
3. Clustering data with the presence of attribute noise: a study of noise completely at random and ensemble of multiple k-means clusterings [J] . Iam-On Natthakan International journal of machine learning and cybernetics . 2020,第3期

机译：带有属性噪声的数据聚类：完全随机且多重k均值聚类的噪声研究
4. Random Centroid Selection for K-means Clustering: A Proposed Algorithm for Improving Clustering Results [C] . Arghyadeep Sen, Manjusha Pandey, Krishna Chakravarty International Conference on Computer Science, Engineering and Applications . 2020

机译：K均值聚类的随机质心选择：改善聚类结果的拟议算法
5. Clustering educational digital library usage data: Comparisons of latent class analysis and K-means algorithms [D] . Xu, Beijie 2011

机译：聚集教育数字图书馆使用数据：潜在类别分析和K-means算法的比较
6. Balancing effort and benefit of K-means clustering algorithms in Big Data realms [O] . Joaquín Pérez-Ortega, Nelva Nely Almanza-Ortega, David Romero 2012

机译：大数据领域中K均值聚类算法的平衡工作和收益
7. Clustering the Age Classified Preprocessed Automated Blood Cell Counter Data using K-Means First Distinct Element Selection and Random Selection Algorithms [O] . D. Minnie, S. Srinivasan 2014

机译：使用K均值优先单元选择和随机选择算法对年龄分类的预处理自动血细胞计数器数据进行聚类

K-walks: clustering gene-expression data using a K-means clustering algorithm optimised by random walks

摘要

著录项

相似文献

相关主题

期刊订阅