首页> 外文学位 >Providing Network Profiling and Tracking Utility in Large Distributed Systems.
【24h】

Providing Network Profiling and Tracking Utility in Large Distributed Systems.

机译:在大型分布式系统中提供网络分析和跟踪实用程序。

获取原文
获取原文并翻译 | 示例

摘要

Within the past few years, the Internet has, to a great extent, impacted every aspect of our daily life. Such impact has played a major role in influencing the design, deployment and functionality of enterprise, campus and even home computer networks. As we increasingly depend on computer networks for communication, information access and storage; entertainment and other activities, managing and securing such networks are critical. Due to its scale and complexity, managing and securing today's large campus or enterprise networks is a challenging task. The scale and complexity comes not only from the number of heterogeneous hosts and devices on the network (e.g., various servers, desktop office client machines, laptops, lab machines, wireless access points, routers and so forth), but also from a wide range of diverse applications running on these machines.;In this thesis, we conduct a study for developing methodologies to profile and track activities within networks by addressing two key problems: capturing the dynamic interaction represented by Internet traffic between inside and outside hosts at the block level; and synthesizing static knowledge-base on hosts and networks to map dynamic interaction to interpretable profiles. We develop methodologies utilizing machine learning techniques for capturing, characterizing and profiling activities within the network. Next, we take these techniques one step further by proposing tools and systems that address profiling and tracking as a utility in a large-scale distributed system.;More specifically, we propose a Hierarchical Extraction of Activity Patterns (HEAPs) methodology to characterize and profile activity patterns within the subnet. We express activities in a host-port association matrix and apply Probabilistic Latent Semantic Analysis (pLSA) to co-cluster dominant and significant activities within the subnet. We also propose a Block-wise (host) Port Activity Matrix (BPAM) to describe the traffic within a block. We then apply Singular Value Decomposition (SVD) low-rank approximation techniques to obtain the low-dimensional subspace representation which captures the typical activities within the block and consequently assign a high-level descriptive label summarizing the activities within the block. We also develop methods to track and quantify changes in the activity within the subnet (or block) over time and demonstrate how to utilize these methods to identify major changes and anomalies within the network. We demonstrate the utility of a light-weigh self-contained tool for multi-level analysis of activities within the network. While the tool does not solve a specific security problem, it helps users and operators localize problems within a small network or individual host.;While our methodologies provide the dynamic interaction within the network, it lacks additional information that help validate the profiling results. Towards that end, we develop a methodology to differentiate dynamic from static IP address blocks. More specifically, we propose a scanning-based technique for identifying dynamic IP addresses blocks within the network. We also include other statistic information by building a system that maps dynamic interaction to static information in a datacenter-like environment. Our system addresses key design issues for providing network management and profiling services in a collaborative system with interpretable characterization and profiling utility.;The thesis serves 1) to propose various novel methodologies utilizing machine learning techniques to extract and profile the behavior of hosts and blocks within the network; 2) to pinpoint design principles for building light-weight as well as large-scale systems for profiling and tracking activities in the network; 3) to propose how to incorporate static information readily available within on-line tools to provide interpretation and mapping for network dynamic interaction.
机译:在过去的几年中,互联网在很大程度上影响了我们日常生活的方方面面。这种影响在影响企业,校园甚至家庭计算机网络的设计,部署和功能方面发挥了重要作用。随着我们越来越依赖计算机网络进行通信,信息访问和存储;娱乐和其他活动,管理和保护此类网络至关重要。由于其规模和复杂性,管理和保护当今的大型校园或企业网络是一项艰巨的任务。规模和复杂性不仅来自网络上异构主机和设备的数量(例如,各种服务器,台式机办公室客户端计算机,笔记本电脑,实验室计算机,无线接入点,路由器等),还来自广泛的范围。这些论文中运行的各种应用程序。;本论文中,我们通过解决两个关键问题来开发用于描述和跟踪网络内活动的方法:在块级捕获由内部和外部主机之间的Internet流量表示的动态交互。 ;并在主机和网络上综合静态知识库,以将动态交互映射到可解释的配置文件。我们利用机器学习技术来开发方法,以捕获,表征和分析网络中的活动。接下来,我们通过提出在大型分布式系统中作为实用程序解决概要分析和跟踪的工具和系统,将这些技术进一步向前发展;更具体地说,我们提出了一种活动模式的分层提取(HEAP)方法来表征和描述子网中的活动模式。我们在主机端口关联矩阵中表达活动,并将概率潜在语义分析(pLSA)应用于子网内的主要活动和重要活动的共同集群。我们还提出了按块(主机)端口活动矩阵(BPAM)来描述块内的流量。然后,我们应用奇异值分解(SVD)低秩逼近技术来获取低维子空间表示形式,该表示形式捕获了块内的典型活动,并因此分配了概述该块内活动的高级描述性标签。我们还开发了跟踪和量化子网(或块)中活动随时间变化的方法,并演示了如何利用这些方法识别网络中的主要变化和异常情况。我们演示了一种轻巧的自包含工具的实用程序,可用于对网络内的活动进行多级分析。虽然该工具不能解决特定的安全问题,但它可以帮助用户和操作员在小型网络或单个主机中定位问题。虽然我们的方法提供了网络内的动态交互,但它缺少有助于验证分析结果的其他信息。为此,我们开发了一种将动态IP地址块与静态IP地址块区分开的方法。更具体地说,我们提出了一种基于扫描的技术来识别网络中的动态IP地址块。通过构建将动态交互映射到类似数据中心的环境中的静态信息的系统,我们还包括其他统计信息。我们的系统解决了在具有可解释的特征和性能分析实用程序的协作系统中提供网络管理和性能分析服务的关键设计问题。论文服务于1)提出各种新颖的方法,利用机器学习技术提取和分析主机和模块内部的行为网络; 2)查明构建轻量级设计原则以及用于分析和跟踪网络活动的大型系统; 3)提出如何将在线工具中容易获得的静态信息纳入网络中,以为网络动态交互提供解释和映射。

著录项

  • 作者

    Sharafuddin, Esam Ahmed.;

  • 作者单位

    University of Minnesota.;

  • 授予单位 University of Minnesota.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2011
  • 页码 158 p.
  • 总页数 158
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号