首页> 外国专利> METHOD AND SYSTEMS FOR THE EFFICIENT COMPRESSION OF GENOMIC SEQUENCE READS

METHOD AND SYSTEMS FOR THE EFFICIENT COMPRESSION OF GENOMIC SEQUENCE READS

机译:有效压缩基因组序列读段的方法和系统

摘要

Method and apparatus for the compression of genome sequence data produced by genome sequencing machines. Sequence reads are coded by aligning them with respect to pre-existing or constructed reference sequences, the coding process is composed of a classification of the reads into data classes followed by the coding of each class in terms of a multiplicity of genomic descriptors. Genomic descriptors of the same type are organized in blocks which are compressed by applying successive transformation stages, binarization and entropy coding. Specific source models and entropy coders are used for each data class and for each associated descriptor.
机译:用于压缩由基因组测序仪产生的基因组序列数据的方法和装置。序列读段通过将它们相对于预先存在的或构建的参考序列进行比对来编码,编码过程包括将读段分类为数据类,然后根据多个基因组描述符对每个类进行编码。相同类型的基因组描述符以块形式组织,通过应用连续的转换阶段,二值化和熵编码来压缩它们。特定的源模型和熵编码器用于每个数据类和每个关联的描述符。

著录项

  • 公开/公告号US2020051667A1

    专利类型

  • 公开/公告日2020-02-13

    原文格式PDF

  • 申请/专利权人 GENOMSYS SA;

    申请/专利号US201716485649

  • 发明设计人 CLAUDIO ALBERTI;MOHAMED KHOSO BALUCH;

    申请日2017-12-15

  • 分类号G16B50/40;G16B45;H03M7/30;G16B50/30;G16B20/20;G16B20/10;G16B30/10;G16B30/20;G16B40/10;G16B50/50;

  • 国家 US

  • 入库时间 2022-08-21 11:23:48

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号