@@The newly-developed metagenomic approach which obtains shotgun DNA sequences directly from environments is rapidly becoming a powerful approach for studying the microorganisms in natural niches and microbiomes resided in and on human bodies. As many sequences remain as unassembled one-pass reads, most conventional gene-finding tools are failure to identify genes in metagenomic samples. Although similarity-based method is widely used currently, it's also questioned for its disability in finding novel genes. Up to now, several algorithms were desired for ab initio metagenomic gene identification. MetaGene and MetaGeneMark construct regressive models between frequencies of oligonucleotides in protein-coding regions and genomic nucleotide composition, while Orphelia uses a large scale machine-learning method with a combination of neural networks and linear discriminates. Here, we present a novel method, MetaGun, which comprises a supervised universal model and a data-specific novel model based on SVM architectre.
展开▼
机译:@@直接从环境中获得霰弹枪DNA序列的新开发的聚蛋白方法正在迅速成为研究自然核桃和微生物体中的微生物的强大方法。随着许多序列仍然是未组装的单通读数,大多数常规的基因查找工具都不识别偏见样品中的基因。虽然目前广泛使用了基于相似的方法,但它还质疑其在寻找新基因方面的残疾。到目前为止,需要几种算法对于AB Initio Metagenomic基因鉴定。 Metagene和Metagenemark在蛋白质编码区和基因组核苷酸组合物中寡核苷酸频率之间的回归模型,而蒙儿生使用具有神经网络的组合和线性判别的大规模机器学习方法。在这里,我们提出了一种新的方法Metagun,其包括基于SVM architectrectre的监督通用模型和数据特定的新型模型。
展开▼