The predictive power of the CluSTr database

Petryszak R; Kretschmann E; Wieser D; Apweiler R

首页> 外文期刊>Bioinformatics >The predictive power of the CluSTr database

【24h】

The predictive power of the CluSTr database

机译：CluSTr数据库的预测能力

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The CluSTr database employs a fully automatic single-linkage hierarchical clustering method based on a similarity matrix. In order to compute the matrix, first all-against-all pair-wise comparisons between protein sequences are computed using the Smith-Waterman algorithm. The statistical significance of the similarity scores is then assessed using a Monte Carlo analysis, yielding Z-values, which are used to populate the matrix. This paper describes automated annotation experiments that quantify the predictive power and hence the biological relevance of the CluSTr data. The experiments utilized the UniProt data-mining framework to derive annotation predictions using combinations of InterPro and CluSTr. We show that this combination of data sources greatly increases the precision of predictions made by the data-mining framework, compared with the use of InterPro data alone. We conclude that the CluSTr approach to clustering proteins makes a valuable contribution to traditional protein classifications.

机译：CluSTr数据库采用基于相似度矩阵的全自动单链接分层聚类方法。为了计算矩阵，首先使用Smith-Waterman算法计算蛋白质序列之间的所有对所有对。然后使用蒙特卡洛分析评估相似性得分的统计显着性，得出Z值，该Z值用于填充矩阵。本文介绍了自动注释实验，该实验对CluSTr数据的预测能力以及生物学相关性进行了量化。实验利用UniProt数据挖掘框架结合使用InterPro和CluSTr来获得注释预测。我们证明，与单独使用InterPro数据相比，这种数据源组合极大地提高了数据挖掘框架进行预测的准确性。我们得出结论，CluSTr对蛋白质进行聚类的方法对传统的蛋白质分类做出了宝贵的贡献。

著录项

来源
《Bioinformatics》 |2005年第18期|共6页
作者
Petryszak R; Kretschmann E; Wieser D; Apweiler R;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类生物科学;
关键词
Z-VALUE STATISTICS; PROTEIN SEQUENCES; ALIGNMENTS; ALGORITHM; FAMILIES;

机译：Z值统计;蛋白质序列;对齐;算法;家族;

相似文献

外文文献
中文文献
专利

1. The predictive power of the CluSTr database [J] . Petryszak R, Kretschmann E, Wieser D, Bioinformatics . 2005,第18期

机译：CluSTr数据库的预测能力
2. On the Predictive Power of Database Classifiers Formed by a Small Network of Interacting Chemical Oscillators [J] . Zommer Ludomir, Gizynski Konrad, Gorecki Jerzy International journal of unconventional computing . 2019,第2期

机译：小型化学相互作用器网络构成的数据库分类器的预测能力
3. Is lymphovascular invasion a powerful predictor for biochemical recurrence in pT3 N0 prostate cancer? Results from the K-CaP database [J] . Yong Hyun Park, Yejin Kim, Hwanjo Yu, Scientific reports. . 2016,第1期

机译：淋巴管侵犯是否是pT3 N0前列腺癌生化复发的有力预测因子？ K-CaP数据库的结果
4. IDPredictor: Predict Database Links in Biomedical Database [C] . Hendrick Mehlhorn, Matthias Lange, Uwe Scholz, International symposium on integrative bioinformatics, 8th annual meeting . 2012

机译：IDPredictor：预测生物医学数据库中的数据库链接
5. Predicting Minimum Control Speed on the Ground (VMCG) and Minimum Control Airspeed (VMCA) of Engine Inoperative Flight Using Aerodynamic Database and Propulsion Database Generators [D] . Hadder, Eric Michael 2016

机译：使用空气动力学数据库和推进数据库生成器预测发动机不工作飞行的地面最小控制速度（VMCG）和最小控制空速（VMCA）
6. Is lymphovascular invasion a powerful predictor for biochemical recurrence in pT3 N0 prostate cancer? Results from the K-CaP database [O] . Yong Hyun Park, Yejin Kim, Hwanjo Yu, -1

机译：淋巴管侵犯是否是pT3 N0前列腺癌生化复发的有力预测因子？ K-CaP数据库的结果
7. Sustainability of the Reanalysis Databases in Predicting the Wind and Wave Power along the European Coasts [O] . Florin Onea, Eugen Rusu 2018

机译：可再分析数据库在预测欧洲沿海风能和波浪能方面的可持续性

The predictive power of the CluSTr database

摘要

著录项

相似文献

相关主题

期刊订阅