...
首页> 外文期刊>Indian Journal of Science and Technology >A Hybridized Clustering Approach based on Rough Set and Fuzzy c-Means to Mine Cholesterol Sequence from ABC Family
【24h】

A Hybridized Clustering Approach based on Rough Set and Fuzzy c-Means to Mine Cholesterol Sequence from ABC Family

机译:基于粗糙集和模糊c均值的矿山胆固醇序列混合聚类方法

获取原文
           

摘要

Objectives: The current study is focused on design of a computational model for human ABC transporters; wherein the TM-sequences matching the CRAC/CARC motif are extracted. Methods: The postulation of cholesterol binding motif (CRAC/CARC), its presence in different proteins and validating its interaction with cholesterol has indeed established the importance of the motif in cholesterol-mediated modulation of protein/signaling pathway. Several viral proteins and membrane proteins (especially alpha-helical trans membrane proteins) such as GPCR transporters are reported to be modulated by cholesterol. The experimental studies are so far performed on only a few proteins in a family but based on an evolutionary conservation and consensus an exploration can be done confidently within a family. However, the representation of motif has a low consensus yielding several false positives thus reducing its reliability. Findings: A computational hybrid clustering method based on rough set with fuzzy c-means algorithm is used to mine the cholesterol sequence from ABC family. Higher weightage is given to those sequences based on the following parameters: motifs with more number of sub motifs, number of helices bearing the motif in a protein and compliance with the orientation of the cholesterol in the membrane for its interaction with the motif. Improvement: A detailed study in a given super family with an approach to reduce redundancy and enrichment can improve its predictability.
机译:目的:目前的研究集中在设计人类ABC转运蛋白的计算模型。其中提取与CRAC / CARC基序匹配的TM序列。方法:胆固醇结合基序(CRAC / CARC)的假定,其在不同蛋白质中的存在以及验证其与胆固醇的相互作用确实确定了该基序在胆固醇介导的蛋白质/信号通路调节中的重要性。据报道,几种病毒蛋白和膜蛋白(特别是α-螺旋跨膜蛋白),例如GPCR转运蛋白,都受到胆固醇的调节。迄今为止,仅对家族中的几种蛋白质进行了实验研究,但是基于进化的保守性和共识,可以自信地在家族中进行探索。但是,主题表示形式的共识度很低,会产生一些误报,从而降低了其可靠性。结果:基于粗糙集的模糊c-均值算法的计算混合聚类方法被用于挖掘ABC家族的胆固醇序列。基于以下参数,对那些序列赋予较高的权重:具有更多亚基序的基序,在蛋白质中带有基序的螺旋数量以及与膜中胆固醇与基序相互作用的方向的顺应性。改进:在给定的超级家族中进行详细研究,采用减少冗余和富集的方法可以提高其可预测性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号