Data Mining of Pancreatic Cancer Protein Databases

机译：胰腺癌蛋白质数据库的数据挖掘

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Data mining of protein databases poses special challenges because many protein databases are non-relational whereas most data mining and machine learning algorithms assume the input data to be a type of relational database that is also representable as an ARFF file. We developed a method to restructure protein databases so that they become amenable for various data mining and machine learning tools. Our restructuring method enabled us to apply both decision tree and support vector machine classifiers to a pancreatic protein database. The SVM classifier that used both GO term and PFAM families to characterize proteins gave us over 73% accuracy in predicting whether a protein is involved in pancreatic cancer.

机译：蛋白质数据库的数据挖掘构成了特殊挑战，因为许多蛋白质数据库是非关系，而大多数数据挖掘和机器学习算法假设输入数据是一种类型的关系数据库，也可以作为ARFF文件表示。我们开发了一种重构蛋白质数据库的方法，以便它们对各种数据挖掘和机器学习工具进行适用。我们的重组方法使我们能够应用两个决策树并支持向胰蛋白质数据库的向量机分类器。使用术语和PFAM系列表征蛋白质的SVM分类器使我们在预测蛋白质涉及胰腺癌是否参与其中超过73％的精度。

著录项

来源
《WSEAS International Conference on Environment, Ecosystems and Development》|2013年||共6页
会议地点
作者
PETER Z. REVESZ; CHRISTOPHER ASSI;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP3-53;
关键词
Pancreatic cancer; Proteins; GO terms; PFAM families; Data mining; Decision trees; Support vector machines;

机译：胰腺癌;蛋白质;GO条款;PFAM系列;数据挖掘;决定树;支持矢量机器;

相似文献

外文文献
中文文献
专利

1. Pancreatic Expression database: a generic model for the organization, integration and mining of complex cancer datasets [J] . Claude Chelala, Stephan A Hahn, Hannah J Whiteman, BMC Genomics . 2007,第1期

机译：胰腺表达数据库：用于组织，整合和挖掘复杂癌症数据集的通用模型
2. Association between Statin Use and Cancer: Data Mining of a Spontaneous Reporting Database and a Claims Database [J] . Mai Fujimoto, Tomoya Higuchi, Kouichi Hosomi, International Journal of Medical Sciences . 2015,第3期

机译：他汀类药物使用与癌症之间的关联：自发报告数据库和索赔数据库的数据挖掘
3. Consensus data mining (CDM) protein secondary structure prediction server: Combining GOR v and fragment database mining (FDM) [J] . Cheng HT, Sen TZ, Jernigan RL, Bioinformatics . 2007,第19期

机译：共识数据挖掘（CDM）蛋白二级结构预测服务器：结合GOR v和片段数据库挖掘（FDM）
4. Data Mining of Pancreatic Cancer Protein Databases [C] . PETER Z. REVESZ, CHRISTOPHER ASSI WSEAS International Conference on Environment, Ecosystems and Development . 2013

机译：胰腺癌蛋白质数据库的数据挖掘
5. Data mining in databases: An extended decision tree approach and methodology in database environment. [D] . Iliskovic, Sinisa A. 2000

机译：数据库中的数据挖掘：数据库环境中的扩展决策树方法和方法。
6. Pancreatic Expression database: a generic model for the organization integration and mining of complex cancer datasets [O] . Claude Chelala, Stephan A Hahn, Hannah J Whiteman, 2007

机译：胰腺表达数据库：用于组织整合和挖掘复杂癌症数据集的通用模型
7. Pancreatic Expression database: a generic model for the organization, integration and mining of complex cancer datasets [O] . Claude Chelala, Stephan A Hahn, Hannah J Whiteman, 2007

机译：胰腺表达数据库：用于组织，整合和挖掘复杂癌症数据集的通用模型

Data Mining of Pancreatic Cancer Protein Databases

摘要

著录项

相似文献

相关主题

期刊订阅