首页> 外文期刊>Informatics in Medicine Unlocked >Identification of cancer related genes using feature selection and association rule mining
【24h】

Identification of cancer related genes using feature selection and association rule mining

机译:使用特征选择和关联规则挖掘鉴定癌症相关基因的鉴定

获取原文
           

摘要

High throughput sequencing generates large volumes of high dimensional data. Identifying informative features from the generated big data is always a challenge. Feature selection reduces complex data into a smaller number of variables while preserving the information as much as possible. In this study, we used DaMiRseq, DESeq2, edgeR and Limma?+?voom to identify differentially expressed genes in 79 small cell lung cancer (sclc) and 7 normal controls. A gene network was used to identify any coexpressed genes. Association rule mining was used to identify any association between connected genes in the network. Limma?+?voom identified the highest number of differentially expressed genes. However, 81 genes were common in the four differential gene expression analysis methods used. After filtering out all nodes with a degree less than 5, the final network had 43 nodes and 63 edges. Association rule mining on the coexpressed genes generated 263 rules. Genes that were common in the rules were: SLC34A2, CAV2, EPAS1, CTSH, AQP1 and LRRK2. These genes have been associated with various types of cancer. Therefore, feature selection using differential gene expression analysis, co-expression networks and association rule mining could help infer relationships among genes and their possibility of having a shared biological function.
机译:高吞吐量排序产生大量的高维数据。识别生成的大数据的信息功能始终是一个挑战。特征选择将复杂数据减少为较少数量的变量,同时尽可能保留信息。在这项研究中,我们使用了Damireseq,Deseq2,Edger和Limma?+?变大,以识别79个小细胞肺癌(SCLC)和7个正常对照中的差异表达基因。基因网络用于鉴定任何共同的基因。关联规则挖掘用于识别网络中连接基因之间的任何关联。 Limma?+?变量识别最多数量的差异表达基因。然而,在使用的四种差分基因表达分析方法中,81个基因常见。在过滤耗尽程度小于5的所有节点后,最终网络具有43个节点和63个边。关联规则挖掘在共同的基因上产生了263项规则。规则中常见的基因是:SLC34A2,CAV2,EPAS1,CTSH,AQP1和LRRK2。这些基因已与各种类型的癌症有关。因此,使用差异基因表达分析,共表达网络和关联规则挖掘的特征选择可以帮助推断基因之间的关系及其具有共同生物学功能的可能性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号