首页> 美国卫生研究院文献>Scientific Reports >Alignment-free method for DNA sequence clustering using Fuzzy integral similarity
【2h】

Alignment-free method for DNA sequence clustering using Fuzzy integral similarity

机译:基于模糊积分相似度的DNA序列聚类的免比对方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

A larger amount of sequence data in private and public databases produced by next-generation sequencing put new challenges due to limitation associated with the alignment-based method for sequence comparison. So, there is a high need for faster sequence analysis algorithms. In this study, we developed an alignment-free algorithm for faster sequence analysis. The novelty of our approach is the inclusion of fuzzy integral with Markov chain for sequence analysis in the alignment-free model. The method estimate the parameters of a Markov chain by considering the frequencies of occurrence of all possible nucleotide pairs from each DNA sequence. These estimated Markov chain parameters were used to calculate similarity among all pairwise combinations of DNA sequences based on a fuzzy integral algorithm. This matrix is used as an input for the neighbor program in the PHYLIP package for phylogenetic tree construction. Our method was tested on eight benchmark datasets and on in-house generated datasets (18 s rDNA sequences from 11 arbuscular mycorrhizal fungi (AMF) and 16 s rDNA sequences of 40 bacterial isolates from plant interior). The results indicate that the fuzzy integral algorithm is an efficient and feasible alignment-free method for sequence analysis on the genomic scale.
机译:下一代测序产生的私有和公共数据库中的大量序列数据由于与基于比对的序列比较方法相关的局限性而提出了新的挑战。因此,迫切需要更快的序列分析算法。在这项研究中,我们开发了一种无需比对的算法,可进行更快的序列分析。我们的方法的新颖之处在于,在无比对模型中将模糊积分与马尔可夫链一起用于序列分析。该方法通过考虑每个DNA序列中所有可能的核苷酸对的出现频率来估计马尔可夫链的参数。这些估计的马尔可夫链参数用于基于模糊积分算法计算DNA序列的所有成对组合之间的相似性。该矩阵用作PHYLIP软件包中用于系统发育树构建的邻居程序的输入。我们的方法在八个基准数据集和内部生成的数据集上进行了测试(来自11种丛枝菌根真菌(AMF)的18 s rDNA序列和来自植物内部的40种细菌分离株的16 s rDNA序列)。结果表明,模糊积分算法是一种高效可行的无序列比对的基因组序列分析方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号