...
首页> 外文期刊>Protein and peptide letters >TRAINER: A General-Purpose Trainable Short Biosequence Classifer
【24h】

TRAINER: A General-Purpose Trainable Short Biosequence Classifer

机译:培训师:通用的可训练短生物序列分类器

获取原文
获取原文并翻译 | 示例
           

摘要

Classifying sequences is one of the central problems in computational biosciences. Several tools have been released to map an unknown molecular entity to one of the known classes using solely its sequence data. However, all of the existing tools are problem-specific and restricted to an alphabet constrained by relevant biological structure. Here, we introduce TRAINER, a new online tool designed to serve as a generic sequence classification platform to enable users provide their own training data with any alphabet therein defined. TRAINER allows users to select among several feature representation schemes and supervised machine learning methods with relevant parameters. Trained models can be saved for future use without retraining by other users. Two case studies are reported for effective use of the system for DNA and protein sequences; candidate effector prediction and nucleolar localization signal prediction. Biological relevance of the results is discussed.
机译:对序列进行分类是计算生物科学中的核心问题之一。已经发布了几种工具,仅使用其序列数据即可将未知分子实体映射到已知类别之一。但是,所有现有工具都是特定于问题的,并且限于受相关生物学结构约束的字母。在这里,我们介绍了TRAINER,这是一种新的在线工具,旨在用作通用序列分类平台,使用户能够提供自己的训练数据以及其中定义的任何字母。 TRAINER允许用户在几种特征表示方案和具有相关参数的监督机器学习方法中进行选择。可以将经过训练的模型保存起来以备将来使用,而无需其他用户重新训练。据报道,有两个案例研究有效地利用了该系统的DNA和蛋白质序列。候选效应子预测和核仁定位信号预测。讨论结果的生物学相关性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号