Classifying G-protein coupled receptors with support vector machines

Rachel Karchin; Kevin Karplus; David Haussler

首页> 外文期刊>Bioinformatics >Classifying G-protein coupled receptors with support vector machines

【24h】

Classifying G-protein coupled receptors with support vector machines

机译：用支持向量机对G蛋白偶联受体进行分类

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Motivation: The enormous amount of protein sequence data uncovered by genome research has increased the demand for computer software that can automate the recognition of new proteins. We discuss the relative merits of various automated methods for recognizing G-Protein Coupled Receptors (GPCRs), a superfamily of cell membrane proteins. GPCRs are found in a wide range of organisms and are central to a cellular signalling network that regulates many basic physiological processes. They are the focus of a significant amount of current pharmaceutical research because they play a key role in many diseases. However, their tertiary structures remain largely unsolved. The methods described in this paper use only primary sequence information to make their predictions. We compare a simple nearest neighbor approach (BLAST), methods based on multiple alignments generated by a statistical profile Hidden Markov Model (HMM), and methods, including Support Vector Machines (SVMs), that transform protein sequences into fixed-length feature vectors. Results: The last is the most computationally expensive method, but our experiments show that, for those interested in annotation-quality classification, the results are worth the effort. In two-fold cross-validation experiments testing recognition of GPCR subfamilies that bind a specific ligand (such as a histamine molecule), the errors per sequence at the Minimum Error Point (MEP) were 13.7% for multi-class SVMs, 17.1% for our SVMtree method of hierarchical multi-class SVM classification, 25.5% for BLAST, 30% for profile HMMs, and 49% for classification based on nearest neighbor feature vector Kernel Nearest Neighbor (kernNN). The percentage of true positives recognized before the first false positive was 65% for both SVM methods, 13% for BLAST, 5%for profile HMMs and 4% for kernNN.

机译：动机：基因组研究发现的大量蛋白质序列数据增加了对可自动识别新蛋白质的计算机软件的需求。我们讨论了识别G蛋白偶联受体（GPCR），细胞膜蛋白的超家族的各种自动化方法的相对优点。 GPCR存在于多种生物中，并且是调节许多基本生理过程的细胞信号网络的核心。由于它们在许多疾病中起着关键作用，因此它们是当前大量药物研究的重点。但是，它们的三级结构仍未解决。本文介绍的方法仅使用主序列信息进行预测。我们比较了一种简单的最近邻方法（BLAST），基于统计配置文件隐马尔可夫模型（HMM）生成的多个比对的方法，以及将蛋白质序列转换为固定长度特征向量的方法，包括支持向量机（SVM）。结果：最后一种是计算上最昂贵的方法，但是我们的实验表明，对于那些对注释质量分类感兴趣的人，结果值得付出努力。在测试识别结合特定配体（例如组胺分子）的GPCR亚家族的双重交叉验证实验中，多类SVM在最小错误点（MEP）处每个序列的错误为13.7％，对于SVM为17.1％我们的SVMtree分层多类SVM分类方法，基于最近邻特征向量内核最近邻（kernNN），BLAST为25.5％，轮廓HMM为30％，分类为49％。两种支持向量机方法在第一个假阳性之前识别出的真实阳性百分比为65％，BLAST为13％，轮廓HMM为5％，kernNN为4％。

著录项

来源
《Bioinformatics》 |2002年第1期|共13页
作者
Rachel Karchin; Kevin Karplus; David Haussler;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类生物科学;生物工程学（生物技术）;
关键词

相似文献

外文文献
中文文献
专利

1. Classifying G-protein coupled receptors with support vector machines [J] . Rachel Karchin, Kevin Karplus, David Haussler Bioinformatics . 2002,第1期

机译：用支持向量机对G蛋白偶联受体进行分类
2. Classification of G-protein coupled receptors based on support vector machine with maximum relevance minimum redundancy and genetic algorithm [J] . Zhanchao Li, Xuan Zhou, Zong Dai, BMC Bioinformatics . 2010,第1期

机译：基于最大相关性最小冗余度和遗传算法的支持向量机对G蛋白偶联受体的分类
3. Prediction and Classification of Human G-protein Coupled Receptors Based on Support Vector Machines [J] . Yun-Fei Wang, Huan Chen, Yan-Hong Zhou 基因组蛋白质组与生物信息学报（英文版） . 2005,第004期

机译：基于支持向量机的人G蛋白偶联受体的预测和分类
4. Classifying G-Protein Coupled Receptors with Hydropathy Blocks and Support Vector Machines [C] . Xing-Ming Zhao, De-Shuang Huang, Shiwu Zhang, International Conference on Intelligent Computing(ICIC 2006); 20060816-19; Kunming(CN) . 2006

机译：用亲水性嵌段和支持向量机对G蛋白偶联受体进行分类
5. Activation of novel signal transduction pathways by human EP1 prostanoid receptors: The G-protein coupled receptors for prostaglandin E2. [D] . Ji, Ruyue. 2010

机译：人类EP1前列腺素受体激活新的信号转导途径：前列腺素E2的G蛋白偶联受体。
6. Predicting the Coupling Specificity of G-protein Coupled Receptors to G-proteins by Support Vector Machines [O] . Cui-Ping Guan, Zhen-Ran Jiang, Yan-Hong Zhou 2005

机译：支持向量机预测G蛋白偶联受体对G蛋白的偶联特异性。
7. Classifying G-protein coupled receptors with support vector machines [O] . Rachel Karchin, Kevin Karplus, David Haussler 2001

机译：用支持向量机对G蛋白偶联受体进行分类

Classifying G-protein coupled receptors with support vector machines

摘要

著录项

相似文献

相关主题

期刊订阅