...
首页> 外文期刊>Bioinformatics >LOESS correction for length variation in gene set-based genomic sequence analysis
【24h】

LOESS correction for length variation in gene set-based genomic sequence analysis

机译:基于基因集的基因组序列分析中长度变化的LOESS校正

获取原文
获取原文并翻译 | 示例
           

摘要

Motivation: Sequence analysis algorithms are often applied to sets of DNA, RNA or protein sequences to identify common or distinguishing features. Controlling for sequence length variation is critical to properly score sequence features and identify true biological signals rather than length-dependent artifacts. Results: Several cis-regulatory module discovery algorithms exhibit a substantial dependence between DNA sequence score and sequence length. Our newly developed LOESS method is flexible in capturing diverse score-length relationships and is more effective in correcting DNA sequence scores for length-dependent artifacts, compared with four other approaches. Application of this method to genes co-expressed during Drosophila melanogaster embryonic mesoderm development or neural development scored by the Lever motif analysis algorithm resulted in successful recovery of their biologically validated cis-regulatory codes. The LOESS length-correction method is broadly applicable, and may be useful not only for more accurate inference of cis-regulatory codes, but also for detection of other types of patterns in biological sequences.
机译:动机:序列分析算法通常应用于DNA,RNA或蛋白质序列集,以识别共有或不同的特征。控制序列长度的变化对于正确地对序列特征进行评分和识别真实的生物信号而不是依赖于长度的伪像至关重要。结果:几种顺式调控模块发现算法在DNA序列得分和序列长度之间表现出很大的依赖性。与其他四种方法相比,我们新开发的LOESS方法可以灵活地捕获各种得分长度关系,并且可以更有效地校正DNA序列得分中与长度相关的伪影。该方法在果蝇胚胎胚中胚层发育或神经发育过程中共表达的基因上的应用(通过杠杆基序分析算法评分)成功地恢复了其生物学验证的顺式调控密码。 LOESS长度校正方法可广泛应用,不仅可用于更准确地推导顺式调节密码,而且可用于检测生物序列中其他类型的模式。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号