首页> 美国卫生研究院文献>Elsevier Public Health Emergency Collection >Some remarks on protein attribute prediction and pseudo amino acid composition
【2h】

Some remarks on protein attribute prediction and pseudo amino acid composition

机译:关于蛋白质属性预测和伪氨基酸组成的一些评论

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

With the accomplishment of human genome sequencing, the number of sequence-known proteins has increased explosively. In contrast, the pace is much slower in determining their biological attributes. As a consequence, the gap between sequence-known proteins and attribute-known proteins has become increasingly large. The unbalanced situation, which has critically limited our ability to timely utilize the newly discovered proteins for basic research and drug development, has called for developing computational methods or high-throughput automated tools for fast and reliably identifying various attributes of uncharacterized proteins based on their sequence information alone. Actually, during the last two decades or so, many methods in this regard have been established in hope to bridge such a gap. In the course of developing these methods, the following things were often needed to consider: (1) benchmark dataset construction, (2) protein sample formulation, (3) operating algorithm (or engine), (4) anticipated accuracy, and (5) web-server establishment. In this review, we are to discuss each of the five procedures, with a special focus on the introduction of pseudo amino acid composition (PseAAC), its different modes and applications as well as its recent development, particularly in how to use the general formulation of PseAAC to reflect the core and essential features that are deeply hidden in complicated protein sequences.
机译:随着人类基因组测序的完成,已知序列的蛋白质数量激增。相反,确定其生物学特性的步伐要慢得多。结果,序列已知蛋白和属性已知蛋白之间的差距变得越来越大。这种不平衡的状况严重限制了我们及时利用新发现的蛋白质进行基础研究和药物开发的能力,它呼吁开发计算方法或高通​​量自动化工具,以根据其序列快速可靠地鉴定未鉴定蛋白质的各种属性。仅信息。实际上,在过去的二十年左右的时间里,已经建立了许多这方面的方法,以期弥合这种差距。在开发这些方法的过程中,通常需要考虑以下事项:(1)基准数据集的构建;(2)蛋白质样品的配方;(3)操作算法(或引擎);(4)预期的准确性;以及(5) )网络服务器的建立。在这篇综述中,我们将讨论这五个程序中的每一个,特别着重介绍伪氨基酸成分(PseAAC)的引入,其不同的模式和应用以及其最近的发展,特别是在如何使用通用配方方面反映了复杂蛋白质序列中深深隐藏的核心和必要特征。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号