首页> 外国专利> Large data set negative information storage model

Large data set negative information storage model

机译：大数据集负信息存储模型

页面导航

摘要
著录项
相似文献

摘要

Systems and methods for storing large data sets, such as genetic sequence information. Within a “targeted subset” of positions with information, the system stores, both variant states and missing states at each position. Reference states are not stored, but are inferred within the targeted subset when neither a variant nor a missing state is stored at a given position. The absence of a variant state at a given position is assumed to be a reference state. The criteria for missing data are defined in pre-processing and are customizable based on the use case. For example, each data point may represent the genetic information of a sample at a position in the genome. The targeted subset may represent those positions that were included in a sequencing test.

机译：用于存储大数据集的系统和方法，例如遗传序列信息。在具有信息的位置的“目标子集”中，系统存储，每个位置的变体状态和丢失状态。不存储参考状态，但是当既没有变体也没有丢失状态时，在目标子集中被存储在给定位置。假设在给定位置处不存在变型状态是参考状态。缺失数据的标准在预处理中定义，并根据用例自定义。例如，每个数据点可以代表基因组中的位置处的样品的遗传信息。目标子集可以表示在测序测试中包括的那些位置。

著录项

公开/公告号US11216442B2

专利类型
公开/公告日2022-01-04

原文格式PDF
申请/专利权人 H. LEE MOFFITT CANCER CENTER & RESEARCH INSTITUTE INC.;
展开▼

申请/专利号US201615751955
发明设计人 JAMIE K. TEER;RUIZHENG LIU;GUILLERMO GONZALEZ-CALDERON;RODRIGO CARVAJAL-PELAEZ;
展开▼

申请日2016-08-12
分类号G06F16/215;G06F16/23;G06F16/22;G16B50;G16B30;G16B50/50;G16B30/10;
国家 US
入库时间 2024-06-14 22:38:26

相似文献

专利
外文文献
中文文献