Gene Selection Using Information Theory and Statistical Approach

Kaberi Das; Jagannath Ray; Debahuti Mishra

首页> 外文期刊>Indian Journal of Science and Technology >Gene Selection Using Information Theory and Statistical Approach

【24h】

Gene Selection Using Information Theory and Statistical Approach

机译：利用信息论和统计方法进行基因选择

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper focuses on a methodological framework for gene selection by two approaches such as statistical approach and information based approach. Statistical measures are univariate measures where the gene relevance score of each gene is calculated without considering its co-relation (positive co-relation or negative co-relation) with other genes. Statistical approach includes Euclidian distance and Pearson co-relation. Mutual information is the measure of mutual dependence between two random variables in the case of probability theory. Information based approach includes information gain and dynamic relevance. In this paper the above gene selection methods are applied on four publicly available data sets such as, breast cancer, leukemia, hepatitis and dermatology to generate the subset of genes. Then, the resultant subset is fed through two classifiers namely Naive-Bayes and Support Vector Machine (SVM). Here also the data sets are directly applied to the classifier without applying the gene selection methods. Finally when we have compared the result, it has been found that all the data sets showing better accuracy when the classifiers are applied after gene selection technique which reflects the importance of gene selection technique.

机译：本文着重于通过两种方法（例如统计方法和基于信息的方法）进行基因选择的方法框架。统计量度是单变量量度，其中在计算每个基因的基因相关性评分时不考虑其与其他基因的正相关（正相关或负相关）。统计方法包括欧几里得距离和皮尔逊相关。在概率论的情况下，互信息是两个随机变量之间相互依赖的度量。基于信息的方法包括信息获取和动态相关性。在本文中，上述基因选择方法应用于四个公开可用的数据集，例如乳腺癌，白血病，肝炎和皮肤病学，以生成基因子集。然后，将所得子集通过两个分类器（即朴素贝叶斯和支持向量机（SVM））进行馈送。在这里，数据集也直接应用于分类器，而无需应用基因选择方法。最后，当我们比较结果时，发现在基因选择技术之后应用分类器时，所有数据集显示出更好的准确性，这反映了基因选择技术的重要性。

著录项

来源
《Indian Journal of Science and Technology》 |2015年第8期|共7页
作者
Kaberi Das; Jagannath Ray; Debahuti Mishra;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类连续性出版物;
关键词

相似文献

外文文献
中文文献
专利

1. Gene Selection Using Information Theory and Statistical Approach [J] . Kaberi Das, Jagannath Ray, Debahuti Mishra Indian Journal of Science and Technology . 2015,第8期

机译：利用信息论和统计方法进行基因选择
2. Statistical Homogeneous Cluster Spectroscopy (SHOCSY): An Optimized Statistical Approach for Clustering of ~1H NMR Spectral Data to Reduce Interference and Enhance Robust Biomarkers Selection [J] . Xin Zou, Elaine Holmes, Jeremy K. Nicholson, Analytical chemistry . 2014,第11期

机译：统计同质团簇光谱（SHOCSY）：〜1H NMR光谱数据聚类的优化统计方法，以减少干扰并增强鲁棒的生物标记物选择
3. A Quantitative Approach to Calculating the Energetic Heterogeneity of Solid Surfaces from an Analysis of TPD Peaks: Comparison of the Results Obtained Using the Absolute Rate Theory and the Statistical Rate Theory of Interfacial Transport [J] . Wladyslaw Rudzinski, Tadeusz Borowiecki, Tomasz Panczyk, The journal of physical chemistry, B. Condensed matter, materials, surfaces, interfaces & biophysical . 2000,第9期

机译：通过TPD峰分析计算固体表面的能量异质性的定量方法：使用绝对速率理论和界面迁移统计速率理论获得的结果的比较
4. An Approach to Feature Selection Based on Fuzzy Clustering and Statistic Theory [C] . Gao Xinbo, Ji Hongbing, Xie Weixin International Conference on Electronic Measuement Instruments . 1999

机译：基于模糊聚类和统计理论的特征选择方法
5. Two topics: A jackknife maximum likelihood approach to statistical model selection, and, A convex hull peeling depth approach to nonparametric massive multivariate data analysis with applications. [D] . Lee, Hyunsook. 2006

机译：两个主题：用于统计模型选择的折刀最大似然方法，以及用于非参数大规模多元数据分析的凸壳剥离深度方法及其应用。
6. Statistical HOmogeneous Cluster SpectroscopY (SHOCSY):An Optimized Statistical Approach for Clustering of 1H NMR Spectral Data to ReduceInterference and Enhance Robust Biomarkers Selection [O] . Xin Zou, Elaine Holmes, Jeremy K. Nicholson, -1

机译：统计同质团簇光谱（SHOCSY）：1H NMR光谱数据聚类以减少的最佳统计方法干扰并增强健壮的生物标志物选择
7. Statistical HOmogeneous Cluster SpectroscopY (SHOCSY): an optimized statistical approach for clustering of ¹H NMR spectral data to reduce interference and enhance robust biomarkers selection. [O] . Zou Xin, Holmes Elaine, Nicholson Jeremy K., 2014

机译：统计同质聚类光谱法（SHOCSY）：一种优化的统计方法，用于对1 H NMR光谱数据进行聚类，以减少干扰并增强可靠的生物标记物选择。
8. PROBABILITY AND STATISTICS IN ITEM ANALYSIS AND CLASSIFICATION PROBLEMS Statistical Decision Theory Approach to Item Selection for Dichotomous Test and Criterion Variables [R] . HOWARD RAIFFA 1957

机译：项目分析和分类问题的概率和统计量二分法和标准变量的项目选择的统计决策理论方法

Gene Selection Using Information Theory and Statistical Approach

摘要

著录项

相似文献

相关主题

期刊订阅