首页> 外文学位 >Machine Learning for Exploring State Space Structure in Genetic Regulatory Networks
【24h】

Machine Learning for Exploring State Space Structure in Genetic Regulatory Networks

机译:机器学习在遗传监管网络中探索状态空间结构

获取原文
获取原文并翻译 | 示例

摘要

Genetic regulatory networks (GRN) offer a useful model for clinical biology. Specifically, such networks capture interactions among genes, proteins, and other metabolic factors. Unfortunately, it is difficult to understand and predict the behavior of networks that are of realistic size and complexity. In this dissertation, behavior refers to the trajectory of a state, through a series of state transitions over time, to an attractor in the network. This project assumes asynchronous Boolean networks, implying that a state may transition to more than one attractor. The goal of this project is to efficiently identify a network's set of attractors and to predict the likelihood with which an arbitrary state leads to each of the network's attractors. These probabilities will be represented using a fuzzy membership vector.;Predicting fuzzy membership vectors using machine learning techniques may address the intractability posed by networks of realistic size and complexity. Modeling and simulation can be used to provide the necessary training sets for machine learning methods to predict fuzzy membership vectors. The experiments comprise several GRNs, each represented by a set of output classes. These classes consist of thresholds tau and ¬tau, where tau = [tau low, tauhigh]; state s belongs to class tau if the probability of its transitioning to attractor A belongs to the range [taulow, tau high]; otherwise it belongs to class ¬tau. Finally, each machine learning classifier was trained with the training sets that was previously collected. The objective is to explore methods to discover patterns for meaningful classification of states in realistically complex regulatory networks.;The research design took a GRN and a machine learning method as input and produced output class and its negation ¬. For each GRN, attractors were identified, data was collected by sampling each state to create fuzzy membership vectors, and machine learning methods were trained to predict whether a state is in a healthy attractor or not. For T-LGL, SVMs had the highest accuracy in predictions (between 93.6% and 96.9%) and precision (between 94.59% and 97.87%). However, naive Bayesian classifiers had the highest recall (between 94.71% and 97.78%). This study showed that all experiments have extreme significance with p value < 0.0001. The contribution this research offers helps clinical biologist to submit genetic states to get an initial result on their outcomes. For future work, this implementation could use other machine learning classifiers such as xgboost or deep learning methods. Other suggestions offered are developing methods that improves the performance of state transition that allow for larger training sets to be sampled.
机译:遗传调控网络(GRN)为临床生物学提供了有用的模型。具体而言,此类网络捕获基因,蛋白质和其他代谢因子之间的相互作用。不幸的是,难以理解和预测具有实际规模和复杂性的网络的行为。在本文中,行为是指状态的轨迹,该轨迹通过一​​系列随时间变化的状态转移到网络中的吸引子。该项目假设异步布尔网络,这意味着一个状态可能过渡到多个吸引子。该项目的目标是有效地识别网络的吸引子集,并预测任意状态导致网络的每个吸引子的可能性。这些概率将使用模糊隶属度矢量表示。使用机器学习技术预测模糊隶属度矢量可以解决现实规模和复杂性网络带来的难处理性。建模和仿真可用于为机器学习方法提供必要的训练集,以预测模糊隶属向量。实验包括几个GRN,每个GRN都由一组输出类别表示。这些类别包括阈值tau和¬tau,其中tau = [tau low,tauhigh];如果状态s过渡到吸引子A的概率属于[taulow,tau high]范围,则​​它属于tau类;否则,它属于¬tau类。最后,每个机器学习分类器都使用先前收集的训练集进行了训练。目的是探索在现实的复杂监管网络中发现有意义的状态分类模式的方法。研究设计采用GRN和机器学习方法作为输入和产生的输出类别及其否定¬。对于每个GRN,识别吸引子,通过对每个状态进行采样以创建模糊隶属向量来收集数据,并训练机器学习方法来预测状态是否处于健康吸引子中。对于T-LGL,SVM的预测准确度(介于93.6%和96.9%之间)和精确度(介于94.59%和97.87%之间)最高。但是,朴素的贝叶斯分类器的查全率最高(介于94.71%和97.78%之间)。这项研究表明,所有实验都具有极高的意义,p值<0.0001。这项研究提供的贡献有助于临床生物学家提交遗传状态以获得其结果的初步结果。对于将来的工作,此实现可以使用其他机器学习分类器,例如xgboost或深度学习方法。提供的其他建议是开发改进状态转换性能的方法,该方法允许对更大的训练集进行采样。

著录项

  • 作者

    Thomas, Rodney Hartfield.;

  • 作者单位

    Nova Southeastern University.;

  • 授予单位 Nova Southeastern University.;
  • 学科 Computer science.;Systematic biology.
  • 学位 Ph.D.
  • 年度 2018
  • 页码 114 p.
  • 总页数 114
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号