首页> 外国专利> OPTIMIZED AND HIGH THROUGHPUT COMPARISON AND ANALYTICS OF LARGE SETS OF GENOME DATA

OPTIMIZED AND HIGH THROUGHPUT COMPARISON AND ANALYTICS OF LARGE SETS OF GENOME DATA

机译：大型基因组数据的优化和高通量比较及分析

页面导航

摘要
著录项
相似文献

摘要

A method, computer program product and system for reconciling a plurality of surprisal data sets of a genetic sequence of an organism being generated from a surprisal data reference genome using a base reference genome. If the base reference genome is not the surprisal data reference genome indicated in the surprisal data set, the surprisal data reference genome is retrieved and compared to the base reference genome to obtain reference genome differences. If a starting location of an instance of the surprisal data set is present in the reference genome differences, the nucleotides of the instance of the surprisal data are compared to the nucleotides of the reference genome difference. If the nucleotides of the instance of the surprisal data are the same as the nucleotides of the reference genome difference, the instance of surprisal data is removed from the surprisal data set.

机译：一种方法，计算机程序产品和系统，该方法，计算机程序产品和系统用于使用基础参考基因组协调从意外数据参考基因组产生的生物体的遗传序列的多个意外数据集。如果基本参考基因组不是意外数据集中指示的意外数据参考基因组，则检索意外数据参考基因组并将其与基本参考基因组进行比较以获得参考基因组差异。如果参考基因组差异中存在意外数据集实例的起始位置，则将意外数据实例的核苷酸与参考基因组差异的核苷酸进行比较。如果意外数据实例的核苷酸与参考基因组差异的核苷酸相同，则将意外数据实例从意外数据集中删除。

著录项

公开/公告号US2014310214A1

专利类型
公开/公告日2014-10-16

原文格式PDF
申请/专利权人 INTERNATIONAL BUSINESS MACHINES CORPORATION;
展开▼

申请/专利号US201313861607
发明设计人 JAMES R. KRAEMER;JOSKO SILOBRCIC;ROBERT R. FRIEDLANDER;
展开▼

申请日2013-04-12
分类号G06N3/12;
国家 US
入库时间 2022-08-21 16:09:42

相似文献

专利
外文文献
中文文献