首页> 外国专利> LARGE DATA SET NEGATIVE INFORMATION STORAGE MODEL

LARGE DATA SET NEGATIVE INFORMATION STORAGE MODEL

机译：大数据集负信息存储模型

页面导航

摘要
著录项
相似文献

摘要

Systems and methods for storing large data sets, such as genetic sequence information. Within a "targeted subset" of positions with information, the system stores, both variant states and missing states at each position. Reference states are not stored, but are inferred within the targeted subset when neither a variant nor a missing state is stored at a given position. The absence of a variant state at a given position is assumed to be a reference state. The criteria for missing data are defined in pre-processing and are customizable based on the use case. For example, each data point may represent the genetic information of a sample at a position in the genome. The targeted subset may represent those positions that were included in a sequencing test.

机译：用于存储大数据集（例如遗传序列信息）的系统和方法。在具有信息的位置的“目标子集”内，系统在每个位置存储变体状态和缺失状态。参考状态不会存储，但是当变体或缺失状态都没有存储在给定位置时，可以在目标子集中推断参考状态。在给定位置不存在变化状态被认为是参考状态。丢失数据的标准在预处理中定义，并且可以根据用例进行自定义。例如，每个数据点可以代表基因组中某个位置的样品的遗传信息。目标子集可以代表测序测试中包括的那些位置。

著录项

公开/公告号WO2017025935A1

专利类型
公开/公告日2017-02-16

原文格式PDF
申请/专利权人 H. LEE MOFFITT CANCER CENTER & RESEARCH INSTITUTE;
展开▼

申请/专利号WO2016IB54868
发明设计人 LIU RUIZHENG;GONZALEZ-CALDERON GUILLERMO;CARVAJAL RODRIGO;TEER JAMIE KRISTOPHER;
展开▼

申请日2016-08-12
分类号G06F17/30;G06F17/40;G06F19/22;
国家 WO
入库时间 2022-08-21 13:32:11

相似文献

专利
外文文献
中文文献