首页> 外国专利> Method and apparatus for the access to bioinformatics data structured in access units

Method and apparatus for the access to bioinformatics data structured in access units

机译:用于访问在访问单元中构造的生物信息学数据的方法和装置

摘要

Method and apparatus for the coding and selective access of compressed genomic sequence data produced by genomic sequencing machines. The coding process is based on aligning sequence reads with respect to pre-existing or constructed reference sequences, on classifying and coding the sequence reads by means of sets of descriptors, and further partitioning the descriptor sets into access units of different types. Efficient selective access to specific genomic regions with the guarantee of retrieving all sequence reads mapped to those regions, is provided by: signaling the type of data mapping configuration used to store or transmit the descriptor sets, determining the minimum number of access units that need to be retrieved and decoded to access a genomic region, providing a master index table that contain all information for optimizing the data access process.
机译:用于编码和选择性访问由基因组测序仪产生的压缩基因组序列数据的方法和装置。编码过程基于相对于预先存在的或构建的参考序列的比对序列读取,基于借助于描述符集对序列读取进行分类和编码,并且进一步将描述符集划分为不同类型的访问单元。通过以下方式提供对特定基因组区域的有效选择性访问,并保证检索到映射到这些区域的所有序列读数:通过信号传递用于存储或传输描述符集的数据映射配置的类型,确定需要访问的最小访问单元数对其进行检索和解码以访问基因组区域,从而提供一个主索引表,其中包含用于优化数据访问过程的所有信息。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号