首页> 外文会议>Asia-Pacific Bioinformatics Conference >REMAS: a new regression model to identify alternative splicing events from exon array data
【24h】

REMAS: a new regression model to identify alternative splicing events from exon array data

机译:Remas:一种新的回归模型,用于识别外显子阵列数据的替代拼接事件

获取原文

摘要

Background: Alternative splicing (AS) is an important regulatory mechanism for gene expression and protein diversity in eukaryotes. Previous studies have demonstrated that it can be causative for, or specific to splicing-related diseases. Understanding the regulation of AS will be helpful for diagnostic efforts and drug discoveries on those splicing-related diseases. As a novel exon-centric microarray platform, exon array enables a comprehensive analysis of AS by investigating the expression of knownand predicted exons. Identifying of AS events from exon array has raised much attention, however, new and powerful algorithms for exon array data analysis are still absent till now.Results: Here, we considered identifying of AS events in the framework of variable selection and developed a regression method for AS detection (REMAS). Firstly, features of alternatively spliced exons were scaled by reasonably defined variables. Secondly, we designed a hierarchical model which can represent gene structure and transcriptional influence to exons, and the lasso type penalties were introduced in calculation because of huge variable size. Thirdly, an iterative two-step algorithm was developed to select alternatively spliced genes and exons. To avoid negative effects introduced by small sample size, we ranked genes as parameters indicating their AS capabilities in an iterative manner. After that, both simulation and real data evaluation showed that REMAS could efficiently identify potential AS events, some of which had been validated by RT-PCR or supported by literature evidence.Conclusion: As a new lasso regression algorithm based on hierarchical model, REMAS has been demonstrated as a reliable and effective method to identify AS events from exon array data.
机译:背景:替代剪接(AS)是真核生物中基因表达和蛋白质多样性的重要调节机制。以前的研究表明,它可能是致命的或特异性的剪接相关的疾病。理解对剪接相关疾病的诊断努力和药物发现有所帮助。作为一种以新颖的外显子性的微阵列平台,外显子阵列能够综合分析,以研究知名和预测的外显子的表达。识别出来自外显子阵列的事件提出了很大的关注,然而,直到现在仍然存在新的和强大的外显子阵列数据分析算法。结果:这里,我们考虑在变量选择框架中识别作为事件,并开发了回归方法作为检测(REMAS)。首先,通过合理定义的变量来缩放可拼接外显子的特征。其次,我们设计了一种分层模型,可以代表基因结构和对外显子的转录影响,并且由于巨大的可变大小,在计算中引入了套索类型的惩罚。第三,开发了一种迭代两步算法以选择可选的剪接基因和外显子。为避免小样本大小引入的负面影响,我们将基因作为参数以迭代方式指示其作为能力的参数。之后,仿真和实际数据评估显示,雷帕斯可以有效地识别潜在的事件,其中一些已经通过RT-PCR验证或通过文献证据支持。结论:作为基于分层模型的新的套索回归算法,REMAS被证明是一种可靠且有效的方法,以从外显子阵列数据识别为事件。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号