...
首页> 外文期刊>International journal of data mining and bioinformatics >A fast and novel approach based on grouping and weighted mRMR for feature selection and classification of protein sequence data
【24h】

A fast and novel approach based on grouping and weighted mRMR for feature selection and classification of protein sequence data

机译:基于分组和加权MRMR的快速和新方法,用于蛋白质序列数据的特征选择和分类

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

The analysis of protein sequences under bioinformatics has gained wide importance in research area. Newly added protein sequences can be analysed using existing proteins and converting them into feature vector form. However, it emerges as a challenging task to deal with huge number of features obtained using sequence encoding techniques. Since all the features obtained are not actually required, a three-stage feature selection approach has been proposed. In the first stage, features are ranked and most irrelevant features are removed; in the second stage, conflicting features are grouped together; and in third stage, a fast approach based on weighted Minimum Redundancy Maximum Relevance (wMRMR) has been proposed and applied on grouped features. Different classification methods are used to analyse the performance of the proposed approach. It is observed that the proposed approach has increased classification accuracy results and reduced time consumption in comparison to the state-of-the-art methods.
机译:生物信息学下蛋白质序列的分析在研究领域具有很重要的。可以使用现有蛋白分析新增的蛋白质序列并将它们转化为特征载体形式。但是,它作为一个具有挑战性的任务,可以处理使用序列编码技术获得的大量功能。由于实际上没有所获得的所有特征,因此提出了一种三阶段特征选择方法。在第一阶段,特征是排名的,并且删除了最无关的功能;在第二阶段,相互冲突的功能将分组;在第三阶段,已经提出了一种基于加权最小冗余最大相关性(WMRMR)的快速方法并应用于分组的特征。不同的分类方法用于分析所提出的方法的性能。观察到,与最先进的方法相比,所提出的方法增加了分类准确度结果和减少时间消耗。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号