首页> 外国专利> Large data set negative information storage model

Large data set negative information storage model

机译:大数据集负信息存储模型

摘要

Systems and methods for storing large data sets, such as genetic sequence information. Within a “targeted subset” of positions with information, the system stores, both variant states and missing states at each position. Reference states are not stored, but are inferred within the targeted subset when neither a variant nor a missing state is stored at a given position. The absence of a variant state at a given position is assumed to be a reference state. The criteria for missing data are defined in pre-processing and are customizable based on the use case. For example, each data point may represent the genetic information of a sample at a position in the genome. The targeted subset may represent those positions that were included in a sequencing test.
机译:用于存储大数据集的系统和方法,例如遗传序列信息。 在具有信息的位置的“目标子集”中,系统存储,每个位置的变体状态和丢失状态。 不存储参考状态,但是当既没有变体也没有丢失状态时,在目标子集中被存储在给定位置。 假设在给定位置处不存在变型状态是参考状态。 缺失数据的标准在预处理中定义,并根据用例自定义。 例如,每个数据点可以代表基因组中的位置处的样品的遗传信息。 目标子集可以表示在测序测试中包括的那些位置。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号