首页> 美国卫生研究院文献>Bioinformatics >Rapid detection of expanded short tandem repeats in personal genomics using hybrid sequencing
【2h】

Rapid detection of expanded short tandem repeats in personal genomics using hybrid sequencing

机译:使用杂交测序快速检测个人基因组中扩展的短串联重复序列

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

>Motivation: Long expansions of short tandem repeats (STRs), i.e. DNA repeats of 2–6 nt, are associated with some genetic diseases. Cost-efficient high-throughput sequencing can quickly produce billions of short reads that would be useful for uncovering disease-associated STRs. However, enumerating STRs in short reads remains largely unexplored because of the difficulty in elucidating STRs much longer than 100 bp, the typical length of short reads.>Results: We propose ab initio procedures for sensing and locating long STRs promptly by using the frequency distribution of all STRs and paired-end read information. We validated the reproducibility of this method using biological replicates and used it to locate an STR associated with a brain disease (SCA31). Subsequently, we sequenced this STR site in 11 SCA31 samples using SMRTTM sequencing (Pacific Biosciences), determined 2.3–3.1 kb sequences at nucleotide resolution and revealed that (TGGAA)- and (TAAAATAGAA)-repeat expansions determined the instability of the repeat expansions associated with SCA31. Our method could also identify common STRs, (AAAG)- and (AAAAG)-repeat expansions, which are remarkably expanded at four positions in an SCA31 sample. This is the first proposed method for rapidly finding disease-associated long STRs in personal genomes using hybrid sequencing of short and long reads.>Availability and implementation: Our TRhist software is available at .>Contact: >Supplementary information: are available at Bioinformatics online.
机译:>动机:短串联重复序列(STR)的长扩展(即2-6 nt的DNA重复序列)与某些遗传性疾病有关。具有成本效益的高通量测序可以快速产生数十亿个短读,这对于揭示与疾病相关的STR很有用。但是,由于难以阐明长度超过100 bp(即短读的典型长度)的STR困难,因此在短读中枚举STR仍未得到充分研究。>结果:我们提出了从头开始的程序,用于检测和定位长STR。通过使用所有STR的频率分布和配对端读取的信息来迅速进行操作。我们使用生物复制品验证了该方法的可重复性,并用于定位与脑部疾病(SCA31)相关的STR。随后,我们使用SMRT TM 测序(Pacific Biosciences)在11个SCA31样品中对该STR位点进行了测序,以核苷酸分辨率确定了2.3–3.1 kb的序列,并揭示了(TGGAA)和(TAAAATAGAA)重复扩增确定了与SCA31相关的重复扩增的不稳定性。我们的方法还可以识别常见的STR,(AAAG)和(AAAAG)重复扩展,它们在SCA31样本中的四个位置上都显着扩展。这是首次提出的使用短和长读段的杂交测序在个人基因组中快速发现与疾病相关的长STR的方法。>可用性和实现:我们的TRhist软件可从以下网址获得。>联系方式:< / strong> >补充信息:可在线访问生物信息学。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号