Scalable Information Gain Variant on Spark Cluster for Rapid Quantification of Microarray

Ransingh Biswajit Ray; Mukesh Kumar; Anand Tirkey; Santanu Kumar Rath

首页> 外文期刊>Procedia Computer Science >Scalable Information Gain Variant on Spark Cluster for Rapid Quantification of Microarray

【24h】

Scalable Information Gain Variant on Spark Cluster for Rapid Quantification of Microarray

机译：用于快速量化微阵列的Spark簇上的可扩展信息增益变量

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Microarray technology is one of the emerging technologies in the field of genetic research, which many researchers often use to monitor expression levels of genes in a given organism. Microarray experiments have wide range of applications in health care sector. The colossal amount of raw gene expression data often leads to computational and analytical challenges including feature selection and classification of the dataset into correct group or class. In this paper, mutual information feature selection method based on spark framework (sf-MIFS) is proposed to determine the pertinent features. After completion of feature selection process, various classifiers i.e., Logistic Regression (sf-LoR) and Naive Bayes (sf-NB) based on Spark framework has been applied to classify the microarray datasets. A detailed comparative analysis in terms of execution time and accuracy is enumerated on the proposed feature selection and classifier methodologies, based on Spark framework and conventional system respectively.

机译：微阵列技术是遗传研究领域中的新兴技术之一，许多研究人员经常使用微阵列技术来监测给定生物体中基因的表达水平。微阵列实验在医疗保健领域具有广泛的应用。大量原始基因表达数据通常会导致计算和分析难题，包括特征选择和将数据集分类为正确的组或类。提出了一种基于Spark框架的互信息特征选择方法（sf-MIFS），用于确定相关特征。在完成特征选择过程之后，基于Spark框架的各种分类器，即逻辑回归（sf-LoR）和朴素贝叶斯（sf-NB）已经被用于对微阵列数据集进行分类。分别基于Spark框架和常规系统，对提出的特征选择和分类器方法进行了详细的比较分析，分析了执行时间和准确性。

著录项

来源
《Procedia Computer Science》 |2016年第1期|共7页
作者
Ransingh Biswajit Ray; Mukesh Kumar; Anand Tirkey; Santanu Kumar Rath;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Rapid quantification of bioaerosols containing L pneumophila by Coriolis~R μ air sampler and chemiluminescence antibody microarrays [J] . Veronika Langer, Georg Hartmann, Reinhard Niessner Journal of Aerosol Science . 2012,第Null期

机译：通过Coriolis〜Rμ空气采样器和化学发光抗体微阵列快速定量测定含有嗜肺乳杆菌的生物气溶胶
2. Spark-IDPP: high-throughput and scalable prediction of intrinsically disordered protein regions with Spark clusters on the Cloud [J] . Malysiak-Mrozek Bozena, Baron Tomasz, Mrozek Dariusz Cluster computing . 2019,第2期

机译：Spark-IDPP：高通量和可扩展的云层上有着火花簇的内部无序蛋白质区的可扩展预测
3. Rapid Nanogram Scale Screening Method of Microarrays to Evaluate Drug-Polymer Blends Using High-Throughput Printing Technology [J] . Taresco Vincenzo, Louzao Iria, Scurr David, Molecular pharmaceutics . 2017,第6期

机译：使用高通量印刷技术进行微阵列评估药物 - 聚合物共混物的快速纳米尺度筛选方法
4. Rapid large-scale oligonucleotide selection for microarrays [C] . Rahmann, S. . 2002

机译：快速大规模选择寡核苷酸用于微阵列
5. Quantification of gene expressions from microarray images using fuzzy clustering. [D] . Gunampally, Maheswar Reddy. 2006

机译：使用模糊聚类从微阵列图像定量基因表达。
6. Genome-scale cluster analysis of replicated microarrays using shrinkage correlation coefficient [O] . Jianchao Yao, Chunqi Chang, Mari L Salmi, 2008

机译：使用收缩相关系数的复制微阵列的基因组规模聚类分析
7. Scalable Information Gain Variant on Spark Cluster for Rapid Quantification of Microarray [O] . Ray Ransingh Biswajit, Kumar Mukesh, Tirkey Anand, 2016

机译：用于快速量化微阵列的Spark簇上的可扩展信息增益变量

Scalable Information Gain Variant on Spark Cluster for Rapid Quantification of Microarray

摘要

著录项

相似文献

相关主题

期刊订阅