Discovery of bidirectional contiguous column coherent bicluster in time-series gene expression data

Yun Xue; Zhihao Ma; Huixin Xu; Zhihao Lu; Xiaohui Hu; Chaoyi Pang

首页> 外文期刊>International journal of machine learning and cybernetics >Discovery of bidirectional contiguous column coherent bicluster in time-series gene expression data

【24h】

Discovery of bidirectional contiguous column coherent bicluster in time-series gene expression data

机译：在时序基因表达数据中发现双向连续列相干双簇

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

AbstractThe application of high-throughput microarray has led to massive gene expression data, urging effective methodology for analysis. Biclustering comes out and serves as a useful tool, performing simultaneous clustering on rows and columns to find subsets of coherently expressed genes and conditions. Specially, in analysis of time–series gene expression data, it is meaningful to restrict biclusters to contiguous time points concerning coherent evolutions. In this paper, BCCC-Bicluster is proposed as an extension of CCC-Bicluster. An exact algorithm based on frequent sequential mining is proposed to find all maximal BCCC-Biclusters. The newly defined Frequent-Infrequent Tree-Array (FITA) is constructed to speed up the traversal process, with useful strategies originating from Apriori property to avoid redundant work. To make it more efficient, the bitwise operation XOR is applied to capture identical or opposite contiguous patterns between two rows. The algorithm is tested in simulated data, yeast microarray data and human microarray data. The experimental results show the proposed algorithm had better performance on the ability to recover the planted biclusters in the synthetic data than CCC-Biclusters and outperformed the one without FITA in speed and scalability. In the enrichment analysis, BCCC-Biclusters are proven to find more significant GO terms involved in biological processes than other three kinds of up-to-date biclusters.

机译： Abstract 高通量微阵列的应用催生了大量基因表达数据，敦促有效的分析方法。双聚类技术（biclustering）问世，它是一种有用的工具，可以对行和列进行同时聚类，以找到相干表达的基因和条件的子集。特别地，在分析时序基因表达数据时，将双聚簇限制在有关连贯进化的连续时间点上是有意义的。本文提出了BCCC-Bicluster作为CCC-Bicluster的扩展。提出了一种基于频繁顺序挖掘的精确算法，以找到所有最大的BCCC-Bicluster。新定义的频繁不频繁树数组（FITA）构造为加快遍历过程，其有用的策略源自Apriori属性，以避免重复工作。为了使其更有效，应用了按位运算XOR来捕获两行之间相同或相反的连续模式。该算法已在模拟数据，酵母微阵列数据和人类微阵列数据中进行了测试。实验结果表明，与CCC-Bicluster相比，该算法在恢复合成数据中的双峰方面具有更好的性能，在速度和可扩展性方面均优于无FITA的算法。在富集分析中，事实证明，BCCC-双簇比其他三种最新的双簇更能发现与生物过程有关的GO术语。

著录项

来源
《International journal of machine learning and cybernetics》 |2018年第3期|413-426|共14页
作者
Yun Xue; Zhihao Ma; Huixin Xu; Zhihao Lu; Xiaohui Hu; Chaoyi Pang;
展开▼
作者单位

Laboratory of Quantum Engineering and Quantum Materials, School of Physics and Telecommunication, South China Normal University;

Laboratory of Quantum Engineering and Quantum Materials, School of Physics and Telecommunication, South China Normal University;

BGI-Shenzhen;

Computer School, South China Normal University;

Laboratory of Quantum Engineering and Quantum Materials, School of Physics and Telecommunication, South China Normal University;

NIT, Zhejiang University;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Bicluster; Time series; Gene expression data; Coherent evolution; Frequent sequential mining; Bitwise operation;

机译：Bicluster;时间序列;基因表达数据;相干演化;频繁顺序挖掘;按位运算;

相似文献

外文文献
中文文献
专利

1. A contiguous column coherent evolution biclustering algorithm for time-series gene expression data [J] . Yun Xue, Meizhen Zhang, Zhengling Liao, International journal of machine learning and cybernetics . 2018,第3期

机译：时间序列基因表达数据的连续列相干进化双聚类算法
2. Leveraging additional knowledge to support coherent bicluster discovery in gene expression data [J] . Alessia Visconti, Francesca Cordero, Ruggero G. Pensa Intelligent data analysis . 2014,第5期

机译：利用其他知识来支持基因表达数据中的连贯双聚簇发现
3. Discovering maximal size coherent biclusters from gene expression data [J] . J. Bagyamani, K. Thangavel International Journal of Healthcare Technology and Management . 2011,第5a6期

机译：从基因表达数据中发现最大大小的连贯双簇
4. A biclustering algorithm with coherent evolution on the contiguous columns facing time-series gene data [C] . Xue Yun, Li Meihang, Liao Zhengling, International Conference on Fuzzy Systems and Knowledge Discovery . 2014

机译：面向时序基因数据的连续列上具有连贯进化的双聚类算法
5. Time-series Gene Expression Data: Models, Challenges, and Potentials [D] . Yao, Shun 2016

机译：时间序列基因表达数据：模型，挑战和潜力
6. Discovery of error-tolerant biclusters from noisy gene expression data [O] . Rohit Gupta, Navneet Rao, Vipin Kumar 2011

机译：从嘈杂的基因表达数据中发现容错双聚簇
7. Discovering coherent biclusters from gene expression data using zero-suppressed binary decision diagrams [O] . Sungroh Yoon, Christine Nardini, Luca Benini, 2014

机译：使用零抑制二元决策图从基因表达数据中发现连贯的双聚类

Discovery of bidirectional contiguous column coherent bicluster in time-series gene expression data

摘要

著录项

相似文献

相关主题

期刊订阅