Biclustering using Parallel Fuzzy Approach for Analysis of Microarray Gene Expression Data

Dwitiya Tyagi-Tiwari; Manoj Jha; Namita Srivastava; Sujoy Das

首页> 外文期刊>International Journal of Computer Science and Security >Biclustering using Parallel Fuzzy Approach for Analysis of Microarray Gene Expression Data

【24h】

Biclustering using Parallel Fuzzy Approach for Analysis of Microarray Gene Expression Data

机译：使用并行模糊方法进行二聚化分析微阵列基因表达数据

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Biclusters are required to analyzing gene expression patterns of genes comparing rows in expression profiles and analyzing expression profiles of samples by comparing columns in gene expression matrix. In the process of biclustering we need to cluster genes and samples. The algorithm presented in this paper is based upon the two-way clustering approach in which the genes and samples are clustered using parallel fuzzy C-means clustering using message passing interface, we call it MFCM. MFCM applied for clustering on genes and samples which maximize membership function values of the data set. It is a parallelized rework of a parallel fuzzy two-way clustering algorithm for microarray gene expression data [9], to study the efficiency and parallelization improvement of the algorithm. The algorithm uses gene entropy measure to filter the clustered data to find biclusters. The method is able to get highly correlated biclusters of the gene expression dataset.We have implemented the algorithm of fuzzy c-means in MATLAB parallel computing platform using MATLABMPI (Message Passing Version of MATLAB). This approach is used to find biclusters of gene expression matrices. The biclustering method is also parallelized to reduce the gene centers with lower entropy filter function. By this function we choose the gene cluster centers with minimum entropy. The algorithm is tested on well-known cell cycle of the budding yeast S. cerevisiae by Cho et al. and Tavazoi et.al data sets, breast cancer subtypes Basal A, Basal B and Leukemia from Golub et al.

机译：需要使用分类器来分析基因的基因表达模式，以比较表达谱中的行，并通过比较基因表达矩阵中的列来分析样品的表达谱。在双重聚类过程中，我们需要对基因和样本进行聚类。本文提出的算法基于双向聚类方法，在该方法中，通过消息传递接口使用并行模糊C均值聚类对基因和样本进行聚类，我们称之为MFCM。 MFCM应用于对基因和样本进行聚类，以最大化数据集的隶属函数值。研究微阵列基因表达数据的并行模糊双向聚类算法[9]的并行重做，以研究该算法的效率和并行化改进。该算法使用基因熵测度来过滤聚类数据以找到双聚类。该方法能够获得高度相关的基因表达数据集。我们在MATLAB并行计算平台中使用MATLABMPI（MATLAB的消息传递版本）实现了模糊c均值算法。此方法用于查找基因表达矩阵的二聚体。双聚类方法也被并行化以减少具有较低熵过滤功能的基因中心。通过此功能，我们选择具有最小熵的基因簇中心。该算法在Cho等人的啤酒酵母新芽中众所周知的细胞周期上进行了测试。和Tavazoi等人的数据集，来自Golub等人的乳腺癌亚型A，Basal B和白血病。

著录项

来源
《International Journal of Computer Science and Security》 |2015年第5期|共页
作者
Dwitiya Tyagi-Tiwari; Manoj Jha; Namita Srivastava; Sujoy Das;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Fine-grained parallelization of fitness functions in bioinformatics optimization problems: gene selection for cancer classification and biclustering of gene expression data [J] . Juan A. Gomez-Pulido, Jose L. Cerrada-Barrios, Sebastian Trinidad-Amado, BMC Bioinformatics . 2016,第1期

机译：生物信息学优化问题中适应度函数的细粒度并行化：用于癌症分类的基因选择和基因表达数据的聚类
2. Gene Expression Data Analysis Using a Novel Approach to Biclustering Combining Discrete and Continuous Data [J] . Christinat Y., Wachmann B., Lei Zhang IEEE/ACM transactions on computational biology and bioinformatics . 2008,第4期

机译：基因表达数据分析使用新颖的方法来组合离散和连续数据
3. DNA Microarray Data Analysis: A Novel Biclustering Algorithm Approach [J] . Alain B. Tchagang, Ahmed H. Tewfik EURASIP journal on applied signal processing . 2006,第16期

机译：DNA微阵列数据分析：一种新颖的聚类算法方法
4. Biclustering of Gene Expression Microarray data using Dynamic deme Parallelized Genetic Algorithm (DdPGA) [C] . Shreya Mishra, Swati Vipsita IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology . 2017

机译：使用动态DEME并行遗传算法（DDPGA）基因表达微阵列数据的BICLUSTING
5. Probe design and data analysis for gene expression microarrays. [D] . Warren, Liling Li. 2003

机译：基因表达微阵列的探针设计和数据分析。
6. Fine-grained parallelization of fitness functions in bioinformatics optimization problems: gene selection for cancer classification and biclustering of gene expression data [O] . Juan A. Gomez-Pulido, Jose L. Cerrada-Barrios, Sebastian Trinidad-Amado, 2016

机译：生物信息学优化问题中适应度函数的细粒度并行化：用于癌症分类的基因选择和基因表达数据的聚类
7. Two-Way Clustering Analysis using Parallel Fuzzy Approach for Microarray Gene Expression Data [O] . Dwitiya Tyagi-Tiwari, Sujoy Das, Namita Srivastava 2015

机译：使用平行模糊方法进行微阵列基因表达数据的双向聚类分析

Biclustering using Parallel Fuzzy Approach for Analysis of Microarray Gene Expression Data

摘要

著录项

相似文献

相关主题

期刊订阅