...
首页> 外文期刊>Journal of Bioinformatics and Computational Biology >SUITE OF TOOLS FOR STATISTICAL N-GRAM LANGUAGE MODELING FOR PATTERN MINING IN WHOLE GENOME SEQUENCES
【24h】

SUITE OF TOOLS FOR STATISTICAL N-GRAM LANGUAGE MODELING FOR PATTERN MINING IN WHOLE GENOME SEQUENCES

机译:用于全基因组序列模式挖掘的统计N-G语言建模工具集

获取原文
获取原文并翻译 | 示例

摘要

Genome sequences contain a number of patterns that have biomedical signi¯cance. Repetitivensequences of various kinds are a primary component of most of the genomic sequence patterns.nWe extended the su±x-array based Biological Language Modeling Toolkit to compute n-gramnfrequencies as well as n-gram language-model based perplexity in windows over the wholengenome sequence to ¯nd biologically relevant patterns. We present the suite of tools and theirnapplication for analysis on whole human genome sequence
机译:基因组序列包含许多具有生物医学意义的模式。各种重复序列是大多数基因组序列模式的主要组成部分。n我们扩展了基于su±x数组的Biological Language Modeling Toolkit,以计算整个基因组窗口中n语法频率和n语法语言模型的困惑度。找出生物学上相关的模式。我们提出了用于分析整个人类基因组序列的工具套件及其应用

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号