首页> 外文学位 >Analysis of network type data using statistical methods.
【24h】

Analysis of network type data using statistical methods.

机译:使用统计方法分析网络类型数据。

获取原文
获取原文并翻译 | 示例

摘要

In many scientific research fields, networks have been widely used to represent or analyze a system. Network type data are high dimensional measurements that either are obtained from a network system or can be represented in terms of networks. In recent years, there has been a rapid growth of high throughput data that can be viewed as network type data, and statistical models have become the major tool for analyzing network type data. In this thesis, I explore network type data from three aspects using different statistical techniques.;First, I study the accurate detection of sources of perturbations in a complex network. In networks, the interactions between nodes make it difficult to localize the sources of the external perturbations. To detect the perturbation source, a 'network filtering' system can be used to filter out this interaction effect. However the theoretical rationale behind this system is still a mystery. To this end, I present a theoretical characterization of why and when 'network filtering' can detect external perturbations accurately. Then I study the implications of the conditions in the context of various network topologies through simulation studies.;Second, I explore the use of network topology to design multi-scale clusters on graphs. Specifically, I adopt the framework of 'diffusion wavelets', and specially modify it for graphs with heavy degree distribution. Based on scaling functions from the diffusion wavelets, I define a collection of vertex subsets across multiple scales on graphs, and apply it on a yeast protein interaction network to obtain multi-scale gene sets. The effectiveness of our gene sets is justified by comparisons with standard gene sets and through an application to differential expression analysis.;Thirdly, I explore the optimal design of perturbations in order to get observations that can help us study networks more effectively. Under the same model used in my first project, I propose an optimal design framework that can model the whole network. I begin with the perturbation design for individual units, and then study the general cases with several approximation methods. Simulation studies have been done to validate my design methods.
机译:在许多科学研究领域中,网络已被广泛用于表示或分析系统。网络类型数据是可以从网络系统获得或可以用网络表示的高维度度量。近年来,可以视为网络类型数据的高吞吐量数据迅速增长,统计模型已成为分析网络类型数据的主要工具。本文利用不同的统计技术从三个方面探讨网络类型数据。首先,研究复杂网络中扰动源的精确检测。在网络中,节点之间的交互使得很难定位外部扰动的来源。为了检测扰动源,可以使用“网络过滤”系统来滤除这种交互作用。但是,该系统背后的理论原理仍然是个谜。为此,我提出了“网络过滤”为何以及何时可以准确检测外部干扰的理论特征。然后,通过仿真研究,研究了各种网络拓扑条件下条件的影响。其次,我探索了使用网络拓扑设计图上的多尺度聚类的方法。具体来说,我采用“扩散小波”的框架,并针对具有严重程度分布的图专门对其进行了修改。基于扩散小波的缩放函数,我定义了图形上多个尺度上的顶点子集的集合,并将其应用于酵母蛋白质相互作用网络上以获得多尺度的基因集。通过与标准基因组进行比较并应用于差异表达分析,证明了我们基因组的有效性。第三,我探索了扰动的最佳设计,以便获得可以帮助我们更有效地研究网络的观察结果。在第一个项目中使用的相同模型下,我提出了一个可以对整个网络建模的最佳设计框架。我从单个单元的扰动设计开始,然后用几种近似方法研究一般情况。仿真研究已经完成,以验证我的设计方法。

著录项

  • 作者

    Yang, Shu.;

  • 作者单位

    Boston University.;

  • 授予单位 Boston University.;
  • 学科 Biology Biostatistics.;Statistics.
  • 学位 Ph.D.
  • 年度 2011
  • 页码 99 p.
  • 总页数 99
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号