Frequent weighted itemset mining from gene expression data

机译：频繁加权替换项目从基因表达数据开采

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Gene Expression Datasets (GEDs) usually consist of the expression values of thousands of genes within hundreds of samples. Frequent itemset and association rule mining algorithms have been applied to discover significant co-expressions among multiple genes from GEDs. To perform these data analyses, gene expression values are commonly discretized into a predefined number of bins. Such an expert-driven and not trivial preprocessing step could bias the quality of the mining result. This paper presents a novel approach to discovering gene correlations from GEDs which does not require data discretization. By representing per-sample gene expression values as item weights, frequent weighted itemsets can be extracted. The discovery of weighted itemsets instead of traditional (not weighted) ones prevents experts from discretizing GEDs before analyzing them and thus improves the effectiveness of the knowledge discovery process. Experiments performed on real GEDs demonstrate the effectiveness of the proposed approach.

机译：基因表达数据集（GED）通常由数百个样品中数千个基因的表达值组成。频繁的项目集和关联规则挖掘算法已应用于发现来自GED的多种基因之间的显着联合表达。为了执行这些数据分析，基因表达值通常被离散地分成预定数量的垃圾箱。这样的专家驱动和不琐碎的预处理步骤可以偏离挖掘结果的质量。本文介绍了一种从未要求数据离散化的GED的基因相关性的新方法。通过将每个样本基因表达值表示为项目权重，可以提取频繁加权项集。对加权项目集的发现而不是传统（未加权）的项目，防止专家在分析之前离散，从而提高了知识发现过程的有效性。实验对实际GED进行的实验表明了所提出的方法的有效性。

著录项

来源
《IEEE International Conference on Bioinformatics and Bioengineering》|2013年||共4页
会议地点
作者
Baralis Elena; Cagliero Luca; Cerquitelli Tania; Chiusano Silvia;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类生物信息论;
关键词

相似文献

外文文献
中文文献
专利

1. Mining Weighted Frequent Itemsets without Candidate Generation in Uncertain Databases [J] . Lin Jerry Chun-Wei, Gan Wensheng, Fournier-Viger Philippe, International Journal of Information Technology & Decision Making . 2017,第6期

机译：未在不确定数据库中挖掘加权频繁项目集
2. An Efficient Method for Mining Frequent Weighted Closed Itemsets from Weighted Item Transaction Databases [J] . Bay Vo Journal of Information Recording . 2017,第1期

机译：一种从加权项目交易数据库中挖掘频繁的加权封闭项目集的有效方法
3. Efficient weighted probabilistic frequent itemset mining in uncertain databases [J] . Li Zhiyang, Chen Fengjuan, Wu Junfeng, Expert Systems . 2021,第5期

机译：在不确定数据库中有效的加权概率频繁漏洞挖掘
4. Frequent weighted itemset mining from gene expression data [C] . Baralis Elena, Cagliero Luca, Cerquitelli Tania, IEEE International Conference on Bioinformatics and Bioengineering . 2013

机译：从基因表达数据频繁加权项集挖掘
5. Mining Frequent Itemsets from Uncertain Data: Extensions to Constrained Mining and Stream Mining. [D] . Hao, Boyu. 2010

机译：从不确定的数据中挖掘频繁项集：约束挖掘和流挖掘的扩展。
6. Genetic Programming and Frequent Itemset Mining to Identify Feature Selection Patterns of iEEG and fMRI Epilepsy Data [O] . Otis Smart, Lauren Burrell -1

机译：遗传程序设计和频繁项集挖掘以识别iEEG和fMRI癫痫数据的特征选择模式
7. MBiS: an efficient method for mining frequent weighted utility itemsets from quantitative databases [O] . Nguyen Duy Ham, Võ Đình Bảy, Nguyen Thi Hong Minh, 2015

机译：MBIS：从定量数据库中挖掘频繁加权实用程序集合的有效方法

Frequent weighted itemset mining from gene expression data

摘要

著录项

相似文献

相关主题

期刊订阅