首页> 外国专利> Methods and systems for predicting DNA accessibility in pan-cancer genomes

Methods and systems for predicting DNA accessibility in pan-cancer genomes

机译:预测全癌基因组DNA可达性的方法和系统

摘要

Techniques for predicting DNA accessibility are provided. A DNase-seq data file and an RNA-seq data file of a plurality of cell types are paired by assigning the DNase-seq data file to at least an RNA-seq data file within the same biotype. The neural network is configured to be trained using a batch of paired data files, wherein configuring the neural network comprises converting the convolutional layer to a first input containing DNA sequence data from the paired data files. To generate a convolution output, wherein the fully connected layer following the convolution layer includes a gene expression level derived from RNA-seq data derived from the pair of data files. Concatenating with two inputs and processing the concatenation to generate a predicted DNA accessibility output. Trained neural networks are used to predict DNA accessibility in genomic sample inputs, including RNA-seq data for novel cell types and whole genome sequencing. [Selection diagram] Fig. 1
机译:提供了预测DNA可及性的技术。通过将DNase-seq数据文件分配给同一生物型内的至少一个RNA-seq数据文件来配对多种细胞类型的DNase-seq数据文件和RNA-seq数据文件。将神经网络配置为使用一批成对的数据文件进行训练,其中配置神经网络包括将卷积层转换为包含来自成对数据文件的DNA序列数据的第一输入。为了生成卷积输出,其中,在卷积层之后的完全连接层包括从RNA-seq数据衍生的基因表达水平,该RNA-seq数据衍生自该对数据文件。与两个输入串联并处理该串联以生成预测的DNA可达性输出。经过训练的神经网络可用于预测基因组样本输入中的DNA可达性,包括用于新型细胞类型和全基因组测序的RNA序列数据。 [选择图]图1

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号