首页> 美国卫生研究院文献>other >Union Exon Based Approach for RNA-Seq Gene Quantification: To Be or Not to Be?
【2h】

Union Exon Based Approach for RNA-Seq Gene Quantification: To Be or Not to Be?

机译:基于联合外显子的RNA-Seq基因定量方法:是还是不是?

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

In recent years, RNA-seq is emerging as a powerful technology in estimation of gene and/or transcript expression, and RPKM (Reads Per Kilobase per Million reads) is widely used to represent the relative abundance of mRNAs for a gene. In general, the methods for gene quantification can be largely divided into two categories: transcript-based approach and ‘union exon’-based approach. Transcript-based approach is intrinsically more difficult because different isoforms of the gene typically have a high proportion of genomic overlap. On the other hand, ‘union exon’-based approach method is much simpler and thus widely used in RNA-seq gene quantification. Biologically, a gene is expressed in one or more transcript isoforms. Therefore, transcript-based approach is logistically more meaningful than ‘union exon’-based approach. Despite the fact that gene quantification is a fundamental task in most RNA-seq studies, however, it remains unclear whether ‘union exon’-based approach for RNA-seq gene quantification is a good practice or not. In this paper, we carried out a side-by-side comparison of ‘union exon’-based approach and transcript-based method in RNA-seq gene quantification. It was found that the gene expression levels are significantly underestimated by ‘union exon’-based approach, and the average of RPKM from ‘union exons’-based method is less than 50% of the mean expression obtained from transcript-based approach. The difference between the two approaches is primarily affected by the number of transcripts in a gene. We performed differential analysis at both gene and transcript levels, respectively, and found more insights, such as isoform switches, are gained from isoform differential analysis. The accuracy of isoform quantification would improve if the read coverage pattern and exon-exon spanning reads are taken into account and incorporated into EM (Expectation Maximization) algorithm. Our investigation discourages the use of ‘union exons’-based approach in gene quantification despite its simplicity.
机译:近年来,RNA-seq成为一种强大的技术,可用于估计基因和/或转录本的表达,并且RPKM(每千碱基的读数/百万读数)被广泛用于代表基因的mRNA相对丰度。通常,用于基因定量的方法大致可分为两类:基于转录本的方法和基于“联合外显子”的方法。基于转录本的方法本质上更加困难,因为基因的不同同工型通常具有很高比例的基因组重叠。另一方面,基于“联合外显子”的方法要简单得多,因此广泛用于RNA-seq基因定量。在生物学上,基因以一种或多种转录亚型表达。因此,基于成绩单的方法在逻辑上比基于“工会外显子”的方法更有意义。尽管在大多数RNA-seq研究中基因定量是一项基本任务,但是,尚不清楚基于“工会外显子”的RNA-seq基因定量方法是否是一种好习惯。在本文中,我们对基于“工会外显子”的方法和基于转录本的方法在RNA-seq基因定量中进行了并排比较。结果发现,基于“联合外显子”的方法显着低估了基因表达水平,基于“联合外显子”的方法的RPKM平均值小于基于转录本方法获得的平均表达的50%。两种方法之间的差异主要受基因中转录本数量的影响。我们分别在基因和转录本水平上进行了差异分析,发现从同工型差异分析中获得了更多的见解,例如同工型开关。如果将阅读覆盖率模式和外显子-外显子跨度读数考虑在内并整合到EM(期望最大化)算法中,同工型定量分析的准确性将会提高。尽管其简单性,我们的研究仍不鼓励在基因定量中使用基于“联合外显子”的方法。

著录项

  • 期刊名称 other
  • 作者单位
  • 年(卷),期 -1(10),11
  • 年度 -1
  • 页码 e0141910
  • 总页数 21
  • 原文格式 PDF
  • 正文语种
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号