首页> 外国专利> Methods and systems for predicting DNA accessibility in the pan-cancer genome

Methods and systems for predicting DNA accessibility in the pan-cancer genome

机译:预测全癌基因组中DNA可达性的方法和系统

摘要

Techniques are provided for predicting DNA accessibility. DNase-seq data files and RNA-seq data files for a plurality of cell types are paired by assigning DNase-seq data files to RNA-seq data files that are at least within a same biotype. A neural network is configured to be trained using batches of the paired data files, where configuring the neural network comprises configuring convolutional layers to process a first input comprising DNA sequence data from a paired data file to generate a convolved output, and fully connected layers following the convolutional layers to concatenate the convolved output with a second input comprising gene expression levels derived from RNA-seq data from the paired data file and process the concatenation to generate a DNA accessibility prediction output. The trained neural network is used to predict DNA accessibility in a genomic sample input comprising RNA-seq data and whole genome sequencing for a new cell type.
机译:提供了预测DNA可及性的技术。通过将DNase-seq数据文件分配给至少在同一生物类型内的RNA-seq数据文件,将多种细胞类型的DNase-seq数据文件和RNA-seq数据文件配对。将神经网络配置为使用成批的配对数据文件进行训练,其中配置神经网络包括配置卷积层以处理包含来自配对数据文件的DNA序列数据的第一输入以生成卷积输出,以及随后的全连接层卷积层将卷积输出与第二输入连接起来,第二输入包含来自配对数据文件中RNA-seq数据的基因表达水平,并处理该连接以生成DNA可及性预测输出。训练有素的神经网络用于预测基因组样本输入中的DNA可达性,其中包括RNA-seq数据和新细胞类型的全基因组测序。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号