首页> 外国专利> OPTIMIZED AND HIGH THROUGHPUT COMPARISON AND ANALYTICS OF LARGE SETS OF GENOME DATA

OPTIMIZED AND HIGH THROUGHPUT COMPARISON AND ANALYTICS OF LARGE SETS OF GENOME DATA

机译:大型基因组数据的优化和高通量比较及分析

摘要

A method, computer program product and system for reconciling a plurality of surprisal data sets of a genetic sequence of an organism being generated from a surprisal data reference genome using a base reference genome. If the base reference genome is not the surprisal data reference genome indicated in the surprisal data set, the surprisal data reference genome is retrieved and compared to the base reference genome to obtain reference genome differences. If a starting location of an instance of the surprisal data set is present in the reference genome differences, the nucleotides of the instance of the surprisal data are compared to the nucleotides of the reference genome difference. If the nucleotides of the instance of the surprisal data are the same as the nucleotides of the reference genome difference, the instance of surprisal data is removed from the surprisal data set.
机译:一种方法,计算机程序产品和系统,该方法,计算机程序产品和系统用于使用基础参考基因组协调从意外数据参考基因组产生的生物体的遗传序列的多个意外数据集。如果基本参考基因组不是意外数据集中指示的意外数据参考基因组,则检索意外数据参考基因组并将其与基本参考基因组进行比较以获得参考基因组差异。如果参考基因组差异中存在意外数据集实例的起始位置,则将意外数据实例的核苷酸与参考基因组差异的核苷酸进行比较。如果意外数据实例的核苷酸与参考基因组差异的核苷酸相同,则将意外数据实例从意外数据集中删除。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号