首页> 美国卫生研究院文献>Genes >Prediction of Long Non-Coding RNAs Based on Deep Learning
【2h】

Prediction of Long Non-Coding RNAs Based on Deep Learning

机译:基于深度学习的长非编码RNA预测

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

With the rapid development of high-throughput sequencing technology, a large number of transcript sequences have been discovered, and how to identify long non-coding RNAs (lncRNAs) from transcripts is a challenging task. The identification and inclusion of lncRNAs not only can more clearly help us to understand life activities themselves, but can also help humans further explore and study the disease at the molecular level. At present, the detection of lncRNAs mainly includes two forms of calculation and experiment. Due to the limitations of bio sequencing technology and ineluctable errors in sequencing processes, the detection effect of these methods is not very satisfactory. In this paper, we constructed a deep-learning model to effectively distinguish lncRNAs from mRNAs. We used k-mer embedding vectors obtained through training the GloVe algorithm as input features and set up the deep learning framework to include a bidirectional long short-term memory model (BLSTM) layer and a convolutional neural network (CNN) layer with three additional hidden layers. By testing our model, we have found that it obtained the best values of 97.9%, 96.4% and 99.0% in F1score, accuracy and auROC, respectively, which showed better classification performance than the traditional PLEK, CNCI and CPC methods for identifying lncRNAs. We hope that our model will provide effective help in distinguishing mature mRNAs from lncRNAs, and become a potential tool to help humans understand and detect the diseases associated with lncRNAs.
机译:随着高通量测序技术的飞速发展,已经发现了大量的转录物序列,如何从转录物中鉴定长的非编码RNA(lncRNA)是一项艰巨的任务。 lncRNA的鉴定和纳入不仅可以更清楚地帮助我们了解自身的生命活动,还可以帮助人类在分子水平上进一步探索和研究该疾病。目前,lncRNA的检测主要包括计算和实验两种形式。由于生物测序技术的局限性以及测序过程中不可避免的错误,这些方法的检测效果不是很令人满意。在本文中,我们构建了一个深度学习模型来有效区分lncRNA和mRNA。我们使用通过训练GloVe算法获得的k-mer嵌入向量作为输入特征,并建立了深度学习框架以包括双向长短期记忆模型(BLSTM)层和卷积神经网络(CNN)层,其中还有三个隐藏的层层。通过测试我们的模型,我们发现它在F1score,准确性和auROC中分别获得了97.9%,96.4%和99.0%的最佳值,与用于识别lncRNA的传统PLEK,CNCI和CPC方法相比,分类性能更好。我们希望我们的模型将为区分lncRNAs中的成熟mRNA提供有效的帮助,并成为帮助人们了解和检测与lncRNAs相关的疾病的潜在工具。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号