首页> 外文期刊>Journal of ICT Research and Applications >Optimization of Spaced K-mer Frequency Feature Extraction using Genetic Algorithms for Metagenome Fragment Classification
【24h】

Optimization of Spaced K-mer Frequency Feature Extraction using Genetic Algorithms for Metagenome Fragment Classification

机译:利用遗传算法优化基因组片段分类的间隔K-mer频率特征提取

获取原文
获取外文期刊封面目录资料

摘要

K -mer frequencies are commonly used in extracting features from metagenome fragments. In spite of this, researchers have found that their use is still inefficient. In this research, a genetic algorithm was employed to find optimally spaced k -mers. These were obtained by generating the possible combinations of match positions and don’t care positions (written as *). This approach was adopted from the concept of spaced seeds in PatternHunter. The use of spaced k -mers could reduce the size of the k -mer frequency feature’s dimension. To measure the accuracy of the proposed method we used the na?ve Bayesian classifier (NBC). The result showed that the chromosome 111111110001, representing spaced k -mer model [111 1111 10001], was the best chromosome, with a higher fitness (85.42) than that of the k -mer frequency feature. Moreover, the proposed approach also reduced the feature extraction time.?
机译:K-mer频率通常用于从元基因组片段中提取特征。尽管如此,研究人员发现他们的使用仍然效率低下。在这项研究中,采用遗传算法来找到最佳间隔的k-mers。这些是通过生成匹配位置和无关位(可能为*)的可能组合而获得的。此方法是从PatternHunter中的间隔种子概念中采用的。使用间隔开的k像素可以减小k像素频率特征尺寸的大小。为了测量所提出方法的准确性,我们使用了朴素贝叶斯分类器(NBC)。结果表明,代表间隔k-mer模型[111 1111 10001]的染色体111111110001是最好的染色体,其适应度(85.42)比k -mer频率特征高。此外,所提出的方法还减少了特征提取时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号