首页> 外国专利> EFFICIENT CLUSTERING OF NOISY POLYNUCLEOTIDE SEQUENCE READS

EFFICIENT CLUSTERING OF NOISY POLYNUCLEOTIDE SEQUENCE READS

机译:有效聚类的嘈杂多核苷酸序列读取

摘要

A technique for clustering DNA reads from polynucleotide sequencing is described. DNA reads with a level of difference that is likely caused by errors in sequencing are grouped together in the same cluster. DNA reads that represent reads of different DNA molecules are placed in different clusters. The clusters are based on edit distance, which is the number of changes necessary to convert a given DNA read into another. The process of forming clusters may be performed iteratively and may use other types of distance that serve as an approximation for edit distance. Well clustered DNA reads provide a starting point for further analysis.
机译:描述了一种用于聚类DNA从多核苷酸测序读取的技术。 DNA以差异的差异读取,该差异可能由测序中的误差引起在同一群集中一起分组。代表不同DNA分子读数的DNA读入不同的簇。群集基于编辑距离,这是将给定DNA读入另一个的给定DNA所需的变化次数。形成簇的过程可以迭代地执行,并且可以使用用作编辑距离的近似的其他类型的距离。良好的聚类DNA读取提供了进一步分析的起点。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号