...
首页> 外文期刊>BMC Bioinformatics >Frequent contiguous pattern mining over biological sequences of protein misfolded diseases
【24h】

Frequent contiguous pattern mining over biological sequences of protein misfolded diseases

机译:频繁的蛋白质蛋白质生物序列的常见模式挖掘

获取原文

摘要

Proteins are integral part of all living beings, which are building blocks of many amino acids. To be functionally active, amino acids chain folds up in a complex way to give each protein a unique 3D shape, where a minor error may cause misfolded structure. Genetic disorder diseases i.e. Alzheimer, Parkinson, etc. arise due to misfolding in protein sequences. Thus, identifying patterns of amino acids is important for inferring protein associated genetic diseases. Recent studies in predicting amino acids patterns focused on only simple protein misfolded disease i.e. Chromaffin Tumor, by association rule mining. However, more complex diseases are yet to be attempted. Moreover, association rules obtained by these studies were not verified by usefulness measuring tools. In this work, we analyzed protein sequences associated with complex protein misfolded diseases (i.e. Sickle Cell Anemia, Breast Cancer, Cystic Fibrosis, Nephrogenic Diabetes Insipidus, and Retinitis Pigmentosa 4) by association rule mining technique and objective interestingness measuring tools. Experimental results show the effectiveness of our method. Adopting quantitative experimental methods, this work can form more reliable, useful and strong association rules i. e. dominating patterns of amino acid of complex protein misfolded diseases. Thus, in addition to usual applications, the identified patterns can be more useful in discovering medicines for protein misfolded diseases and thereby may open up new opportunities in medical science to handle genetic disorder diseases.
机译:蛋白质是所有生物的组成部分,这是许多氨基酸的构建块。在功能上活性,氨基酸链以复杂的方式折叠,以给出每个蛋白质是唯一的3D形状,其中次要误差可能导致错误折叠的结构。由于蛋白质序列中的错误折叠,因此出现了遗传障碍疾病。出现阿尔茨海默,帕金森等。因此,鉴定氨基酸的模式对于推断蛋白质相关的遗传疾病是重要的。最近在预测氨基酸图案的研究中仅针对简单的蛋白质错误折叠的疾病,即嗜铬肿瘤,通过关联规则开采。但是,尚未尝试更复杂的疾病。此外,通过有用的测量工具没有验证这些研究所获得的关联规则。在这项工作中,通过关联规则采矿技术和客观有趣测量工具,我们分析了与复杂蛋白质错误折叠疾病(即镰状细胞贫血,乳腺癌,囊性纤维化,肾病性糖尿病4)的蛋白质序列进行了分析(即镰状细胞贫血,乳腺癌,囊性纤维化,肾脏炎,肾脏炎。实验结果表明了我们方法的有效性。采用定量实验方法,这项工作可以形成更可靠,有用和强大的关联规则我。 e。复合蛋白质错误折叠疾病的氨基酸的主导模式。因此,除了通常的应用外,鉴定的模式在发现蛋白质错误折叠疾病的药物方面更有用,从而可以开辟医学科学的新机遇来处理遗传疾病疾病。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号