An improved algorithm for clustering gene expression data

Sanghamitra Bandyopadhyay; Anirban Mukhopadhyay; Ujjwal Maulik

首页> 外文期刊>Bioinformatics >An improved algorithm for clustering gene expression data

【24h】

An improved algorithm for clustering gene expression data

机译：一种改进的基因表达数据聚类算法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Motivation: Recent advancements in microarray technology allows simultaneous monitoring of the expression levels of a large number of genes over different time points. Clustering is an important tool for analyzing such microarray data, typical properties of which are its inherent uncertainty, noise and imprecision. In this article, a two-stage clustering algorithm, which employs a recently proposed variable string length genetic scheme and a multiobjective genetic clustering algorithm, is proposed. It is based on the novel concept of points having significant membership to multiple classes. An iterated version of the well-known Fuzzy C-Means is also utilized for clustering. Results: The significant superiority of the proposed two-stage clustering algorithm as compared to the average linkage method, Self Organizing Map (SOM) and a recently developed weighted Chinese restaurant-based clustering method (CRC), widely used methods for clustering gene expression data, is established on a variety of artificial and publicly available real life data sets. The biological relevance of the clustering solutions are also analyzed.

机译：动机：微阵列技术的最新进展允许在不同时间点同时监视大量基因的表达水平。聚类是分析此类微阵列数据的重要工具，其典型特性是其固有的不确定性，噪声和不精确性。本文提出了一种两阶段聚类算法，该算法采用了最近提出的可变字符串长度遗传方案和多目标遗传聚类算法。它基于对多个类别具有显着成员资格的积分的新颖概念。众所周知的Fuzzy C-Means的迭代版本也用于聚类。结果：与平均链接方法，自组织映射（SOM）和最近开发的加权中国餐馆加权聚类方法（CRC）相比，拟议的两阶段聚类算法具有明显的优势，该方法广泛用于基因表达数据的聚类基于各种人工和公开可用的现实生活数据集建立。还分析了聚类解决方案的生物学相关性。

著录项

来源
《Bioinformatics》 |2007年第21期|p.2859-2865|共7页
作者
Sanghamitra Bandyopadhyay; Anirban Mukhopadhyay; Ujjwal Maulik;
展开▼
作者单位

Machine Intelligence Unit, Indian Statistical Institute, Kolkata-700108;

展开▼
收录信息美国《科学引文索引》(SCI);美国《化学文摘》(CA);
原文格式 PDF
正文语种 eng
中图分类生物科学;生物工程学（生物技术）;
关键词
入库时间 2022-08-17 23:49:36

相似文献

外文文献
中文文献
专利

1. Multi-stage filtering for improving confidence level and determining dominant clusters in clustering algorithms of gene expression data [J] . KasimS., DerisS., OthmanR.M. Computers in Biology and Medicine . 2013,第9期

机译：多阶段过滤可提高基因表达数据的聚类算法的置信度并确定优势簇
2. An Improved Biclustering Algorithm for Gene Expression Data [J] . Sheng-Hua Jin, Li Hua The Open Cybernetics & Systemics Journal . 2017,第1期

机译：基因表达数据的一种改进的聚类算法
3. Clustering gene expression data analysis using an improved EM algorithm based on multivariate elliptical contoured mixture models [J] . Zhe Liu, Yu-qing Song, Cong-hua Xie, Optik: Zeitschrift fur Licht- und Elektronenoptik: = Journal for Light-and Electronoptic . 2014,第21期

机译：基于多元椭圆轮廓混合模型的改进EM算法聚类基因表达数据分析
4. Evidence Accumulation from some clustering algorithms to improve gene expression data classification [C] . Ranjita Das, Sriparna Saha International Conference on Soft Computing and Machine Intelligence . 2016

机译：一些聚类算法的证据累积改善基因表达数据分类
5. Clustering algorithms for time series gene expression in microarray data. [D] . Zhang, Guilin. 2012

机译：微阵列数据中时间序列基因表达的聚类算法。
6. Genetic Algorithms Applied to Multi-Class Clustering for Gene Expression Data [O] . Haiyan Pan, Jun Zhu, Danfu Han 2003

机译：遗传算法应用于基因表达数据的多类聚类
7. IMPROVED BICLUSTERING ALGORITHM FOR GENE EXPRESSION DATA [O] . 2013

机译：基因表达数据的改进的聚类算法

An improved algorithm for clustering gene expression data

摘要

著录项

相似文献

相关主题

期刊订阅