首页> 美国卫生研究院文献>BMC Genomics >Inferential considerations for low-count RNA-seq transcripts: a case study on the dominant prairie grass Andropogon gerardii
【2h】

Inferential considerations for low-count RNA-seq transcripts: a case study on the dominant prairie grass Andropogon gerardii

机译:低计数RNA-seq转录本的推论注意事项:优势草原草Andropogon gerardii的案例研究

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

BackgroundDifferential expression (DE) analysis of RNA-seq data still poses inferential challenges, such as handling of transcripts characterized by low expression levels. In this study, we use a plasmode-based approach to assess the relative performance of alternative inferential strategies on RNA-seq transcripts, with special emphasis on transcripts characterized by a small number of read counts, so-called low-count transcripts, as motivated by an ecological application in prairie grasses. Big bluestem (Andropogon gerardii) is a wide-ranging dominant prairie grass of ecological and agricultural importance to the US Midwest while edaphic subspecies sand bluestem (A. gerardii ssp. Hallii) grows exclusively on sand dunes. Relative to big bluestem, sand bluestem exhibits qualitative phenotypic divergence consistent with enhanced drought tolerance, plausibly associated with transcripts of low expression levels. Our dataset consists of RNA-seq read counts for 25,582 transcripts (60 % of which are classified as low-count) collected from leaf tissue of individual plants of big bluestem (n = 4) and sand bluestem (n = 4). Focused on low-count transcripts, we compare alternative ad-hoc data filtering techniques commonly used in RNA-seq pipelines and assess the inferential performance of recently developed statistical methods for DE analysis, namely DESeq2 and edgeR robust. These methods attempt to overcome the inherently noisy behavior of low-count transcripts by either shrinkage or differential weighting of observations, respectively.
机译:背景技术RNA-seq数据的差异表达(DE)分析仍然带来推论性挑战,例如以低表达水平为特征的转录本的处理。在这项研究中,我们使用基于等离子的方法评估RNA seq转录本上其他推论策略的相对表现,并特别着重以少量读计数为特征的转录本,即所谓的低计数转录本。通过在草原草中的生态应用。大蓝茎(Andropogon gerardii)是对美国中西部具有重要生态和农业意义的广泛优势草原草,而食性亚种沙蓝茎(A. gerardii ssp。Hallii)仅在沙丘上生长。相对于大蓝茎,沙蓝茎表现出定性表型差异,与增强的耐旱性相一致,可能与低表达水平的转录本有关。我们的数据集由从大蓝茎(n = 4)和沙蓝茎(n = 4)的单株植物的叶片组织收集的25,582个转录本的RNA-seq读取计数组成(其中60%分类为低计数)。重点关注低计数转录本,我们比较了RNA-seq管道中常用的替代性临时数据过滤技术,并评估了最近开发的用于DE分析的统计方法的推断性能,即DESeq2和edgeR健壮性。这些方法试图通过缩小或分别加权观测值来克服低计数转录本的固有噪声行为。

著录项

相似文献

  • 外文文献
  • 中文文献
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号