...
首页> 外文期刊>Bioinformatics >TEclass--a tool for automated classification of unknown eukaryotic transposable elements
【24h】

TEclass--a tool for automated classification of unknown eukaryotic transposable elements

机译:TEclass-一种自动分类未知真核转座因子的工具

获取原文
获取原文并翻译 | 示例

摘要

MOTIVATION: The large number of sequenced genomes required the development of software that reconstructs the consensus sequences of transposons and other repetitive elements. However, the available tools usually focus on the accurate identification of raw repeats and provide no information about the taxonomic position of the reconstructed consensi. TEclass is a tool to classify unknown transposable elements into their four main functional categories, which reflect their mode of transposition: DNA transposons, long terminal repeats (LTRs), long interspersed nuclear elements (LINEs) and short interspersed nuclear elements (SINEs). TEclass uses machine learning support vector machine (SVM) for classification based on oligomer frequencies. It achieves 90-97% accuracy in the classification of novel DNA and LTR repeats, and 75% for LINEs and SINEs. AVAILABILITY: http://www.compgen.uni-muenster.de/teclass, stand alone program upon request.
机译:动机:大量的测序基因组需要开发软件来重建转座子和其他重复元件的共有序列。但是,可用的工具通常专注于原始重复的准确识别,并且不提供有关重建共识的分类位置的信息。 TEclass是一种将未知的转座因子分为四个主要功能类别的工具,这些功能类别反映了它们的转座方式:DNA转座子,长末端重复序列(LTR),长散布的核元件(LINEs)和短散布的核元件(SINE)。 TEclass使用机器学习支持向量机(SVM)进行基于低聚物频率的分类。它在新型DNA和LTR重复序列的分类中达到90-97%的准确度,而LINE和SINE的准确度则为75%。可用性:http://www.compgen.uni-muenster.de/teclass,可根据要求提供独立程序。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号