首页> 外国专利> METHOD AND SYSTEMS FOR RESTORING GENOMIC REFERENCE SEQUENCES FROM COMPRESSED READINGS OF A GENOMIC SEQUENCE

METHOD AND SYSTEMS FOR RESTORING GENOMIC REFERENCE SEQUENCES FROM COMPRESSED READINGS OF A GENOMIC SEQUENCE

机译:从基因组序列的压缩读数恢复基因组参考序列的方法和系统

摘要

A method and apparatus that includes representing the reference genome by means of syntactic elements describing the differences between the reference genome and aligned genomic sequences. Genomic sequences are pre-aligned with the reference genome. Each aligned genomic sequence is described using a subset of syntactic elements. The syntactic elements describing all genomic sequences are divided into blocks in accordance with their statistical properties. Each block of syntactic elements is entropy encoded. Then the entropy encoded blocks are concatenated to form a compressed binary data stream. Differences between the reference genome and aligned sequences are expressed through syntactic elements. Syntactic elements are divided into blocks in accordance with their statistical properties and each block is entropy encoded. Then, entropy-encoded syntax elements are embedded in a binary data stream of encoded blocks of syntax elements describing aligned reads. The proposed method allows you to restore the reference genome used for alignment when decoding compressed genomic sequences, while maintaining various possibilities for random access to compressed data and providing effective compression.
机译:一种方法和设备,包括通过描述参考基因组和比对的基因组序列之间差异的句法元素来表示参考基因组。基因组序列与参考基因组预先比对。使用语法元素的子集描述每个比对的基因组序列。描述所有基因组序列的句法元素根据其统计特性被分为多个块。语法元素的每个块都经过熵编码。然后,将熵编码的块级联以形成压缩的二进制数据流。参考基因组和比对序列之间的差异通过句法元件表达。语法元素根据其统计属性分为多个块,并且每个块都经过熵编码。然后,将熵编码的语法元素嵌入描述了对齐读取的语法元素的编码块的二进制数据流中。提出的方法可让您在解码压缩的基因组序列时恢复用于比对的参考基因组,同时保留各种随机访问压缩数据并提供有效压缩的可能性。

著录项

  • 公开/公告号EA201991906A1

    专利类型

  • 公开/公告日2020-01-21

    原文格式PDF

  • 申请/专利权人 ГЕНОМСЫС СА;

    申请/专利号EA20190091906

  • 发明设计人 КЛАУДИО;МОХАМЕД ХОСО;

    申请日2017-12-14

  • 分类号C40B50/02;G06F19/26;G06F19/22;

  • 国家 EA

  • 入库时间 2022-08-21 11:17:36

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号