首页> 外国专利> EFFICIENT CLUSTERING OF NOISY POLYNUCLEOTIDE SEQUENCE READS

EFFICIENT CLUSTERING OF NOISY POLYNUCLEOTIDE SEQUENCE READS

机译:嘈杂的多核苷酸序列读段的有效聚类

摘要

A technique for clustering DNA reads from polynucleotide sequencing is described. DNA reads with a level of difference that is likely caused by errors in sequencing are grouped together in the same cluster. DNA reads that represent reads of different DNA molecules are placed in different clusters. The clusters are based on edit distance, which is the number of changes necessary to convert a given DNA read into another. The process of forming clusters may be performed iteratively and may use other types of distance that serve as an approximation for edit distance. Well clustered DNA reads provide a starting point for further analysis.
机译:描述了一种聚类来自多核苷酸测序的DNA读数的技术。 DNA读数具有一定水平的差异,该差异可能是由测序错误引起的,这些读数在同一簇中分组在一起。代表不同DNA分子读取的DNA读取位于不同的簇中。这些簇基于编辑距离,编辑距离是将给定的DNA读取转换为另一个所需的更改数量。形成簇的过程可以迭代地执行,并且可以使用用作编辑距离的近似值的其他类型的距离。聚集良好的DNA读数为进一步分析提供了起点。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号