Mathematical Basis of Predicting Dominant Function in Protein Sequences by a Generic HMM–ANN Algorithm

Siddhartha Kundu

首页> 外文期刊>Acta Biotheoretica >Mathematical Basis of Predicting Dominant Function in Protein Sequences by a Generic HMM–ANN Algorithm

【24h】

Mathematical Basis of Predicting Dominant Function in Protein Sequences by a Generic HMM–ANN Algorithm

机译：通用HMM-ANN算法预测蛋白质序列中显性功能的数学依据

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The accurate annotation of an unknown protein sequence depends on extant data of template sequences. This could be empirical or sets of reference sequences, and provides an exhaustive pool of probable functions. Individual methods of predicting dominant function possess shortcomings such as varying degrees of inter-sequence redundancy, arbitrary domain inclusion thresholds, heterogeneous parameterization protocols, and ill-conditioned input channels. Here, I present a rigorous theoretical derivation of various steps of a generic algorithm that integrates and utilizes several statistical methods to predict the dominant function in unknown protein sequences. The accompanying mathematical proofs, interval definitions, analysis, and numerical computations presented are meant to offer insights not only into the specificity and accuracy of predictions, but also provide details of the operatic mechanisms involved in the integration and its ensuing rigor. The algorithm uses numerically modified raw hidden markov model scores of well defined sets of training sequences and clusters them on the basis of known function. The results are then fed into an artificial neural network, the predictions of which can be refined using the available data. This pipeline is trained recursively and can be used to discern the dominant principal function, and thereby, annotate an unknown protein sequence. Whilst, the approach is complex, the specificity of the final predictions can benefit laboratory workers design their experiments with greater confidence.

机译：未知蛋白质序列的准确注释取决于模板序列的远端数据。这可以是经验的或参考序列集，并且提供了一个有可能功能的详尽池。预测主导函数的单个方法具有缺点，例如不同程度的序列间冗余，任意域包含阈值，异构参数化协议和不良输入通道。这里，我介绍了一般算法的各个步骤的严格理论衍生，其集成并利用了几种统计方法来预测未知蛋白质序列中的显性函数。所提供的伴随的数学证据，间隔定义，分析和数值计算旨在提供不仅进入预测的特殊性和准确性的见解，而且还提供了整合中所涉及的操作机制的细节及其随后的严格。该算法使用数值修改的RAW隐马尔可夫Model模型分数的良好定义的训练序列集，并在已知功能的基础上群集它们。然后将结果馈入人工神经网络，其预测可以使用可用数据来改进。该管道经过递归培训，可用于辨别主导的主函数，从而诠释了未知的蛋白质序列。虽然，这种方法很复杂，但最终预测的特殊性可以受益实验室工作人员以更大的信心设计他们的实验。

著录项

来源
《Acta Biotheoretica》 |2018年第2期|共14页
作者
Siddhartha Kundu;
展开▼
作者单位

Department of Biochemistry Dr. Baba Saheb Ambedkar Medical College and Hospital Government of NCT of Delhi;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类生物科学;
关键词
Algorithm; Artificial neural network; Dominant protein function; Hidden markov model; Subfamily;

机译：算法;人工神经网络;主导蛋白质功能;隐藏的马尔可夫模型;亚家族;

相似文献

外文文献
中文文献
专利

1. Mathematical Basis of Predicting Dominant Function in Protein Sequences by a Generic HMM-ANN Algorithm (vol 66, pg 135, 2018) [J] . Kundu Siddhartha Acta Biotheoretica . 2020,第3期

机译：通用HMM-ANN算法预测蛋白质序列中显性功能的数学依据（Vol 66，PG 135,2018）
2. Mathematical basis of improved protein subfamily classification by a HMM-based sequence filter [J] . Kundu Siddhartha Mathematical Biosciences: An International Journal . 2017,第期

机译：基于HMM的序列滤波器改进蛋白质亚家族分类的数学依据
3. Algorithm for predicting functionally equivalent proteins from BLAST and HMMER searches [J] . Yu D.S., Lee D.-H., Kim S.K., Journal of microbiology and biotechnology . 2012,第8期

机译：通过BLAST和HMMER搜索预测功能等同蛋白的算法
4. Radial basis function neural network optimized by a genetic algorithm for soybean protein sequence residue spatial distance prediction [C] . Guang-Zheng Zhang, De-Shuang Huang Evolutionary Computation, 2004. CEC2004. Congress on . 2004

机译：遗传算法优化的径向基函数神经网络用于大豆蛋白质序列残基空间距离预测
5. Predicting protein function using sequence derived features selected by genetic algorithms. [D] . Kernytsky, Andrew. 2008

机译：使用遗传算法选择的序列衍生特征预测蛋白质功能。
6. Mathematical Basis of Predicting Dominant Function in Protein Sequences by a Generic HMM–ANN Algorithm [O] . Siddhartha Kundu -1

机译：通用HMM–ANN算法预测蛋白质序列中显性功能的数学基础
7. Correction to: Mathematical Basis of Predicting Dominant Function in Protein Sequences by a Generic HMM–ANN Algorithm [O] . Siddhartha Kundu 2020

机译：校正：通用HMM-ANN算法预测蛋白质序列中显性函数的数学依据

Mathematical Basis of Predicting Dominant Function in Protein Sequences by a Generic HMM–ANN Algorithm

摘要

著录项

相似文献

相关主题

期刊订阅