首页> 外国专利> method and systems for efficient compression of genomic sequence readings

method and systems for efficient compression of genomic sequence readings

机译:有效压缩基因组序列读数的方法和系统

摘要

a method and apparatus for compressing genome sequence data produced by genome sequencing machines are disclosed. sequence readings are encoded by aligning them with pre-existing or built reference sequences, the coding process consists of a classification of readings into classes of data followed by the coding of each class in terms of a multiplicity of genomic descriptors . genomic descriptors of the same type are organized in blocks that are compressed by applying successive steps of transformation, binarization and entropy coding. specific source models and entropy encoders are used for each class of data and for each associated descriptor.
机译:公开了一种用于压缩由基因组测序仪产生的基因组序列数据的方法和装置。序列读数是通过将它们与预先存在或构建的参考序列进行比对来编码的,编码过程包括将读数分类为数据类别,然后根据多个基因组描述符对每个类别进行编码。相同类型的基因组描述符以块形式组织,通过应用连续的变换,二值化和熵编码步骤将其压缩。特定的源模型和熵编码器用于每类数据和每个关联的描述符。

著录项

  • 公开/公告号BR112019016232A2

    专利类型

  • 公开/公告日2020-04-07

    原文格式PDF

  • 申请/专利权人 GENOMSYS SA;

    申请/专利号BR20191116232

  • 发明设计人 CLAUDIO ALBERTI;MOHAMED KHOSO BALUCH;

    申请日2017-12-15

  • 分类号G16B45;

  • 国家 BR

  • 入库时间 2022-08-21 11:18:30

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号