首页> 外文会议>IEEE International Conference on Industrial and Information Systems >Feasibility of using machine learning to access control in Squid proxy server
【24h】

Feasibility of using machine learning to access control in Squid proxy server

机译:在Squid代理服务器中使用机器学习进行访问控制的可行性

获取原文

摘要

Fast Internet connectivity and billions of web sites have made World Wide Web an attractive place for people to use the Internet in their day-to-day life. Educational institutes provide the Internet access to students mainly for educational purposes. However, most of the time, students are allowed to access any content on the web. Therefore, the full bandwidth is consumed due to access to non-educational content such as streaming non-educational videos and downloading large image files, etc. Prevention of Internet usage on non-education content is practically difficult due to various reasons. Usually, this is implemented in the proxy server through maintaining a blacklist of URLs. Most of the time, this is a static list of URLs. With the fast growing content on the World Wide Web maintaining a static blacklist is impractical. In this paper, we propose a methodology to generate dynamic blacklist of URLs using machine learning techniques. We experimentally investigate several machine learning algorithms to predict whether the URL in concern is educational or noneducational. The results of the initial experiments show that linear support vector machines can be used to predict the content with 98.9% accuracy.
机译:快速的Internet连接和数十亿个网站使万维网成为人们在日常生活中使用Internet的诱人场所。教育机构主要出于教育目的向学生提供Internet访问。但是,大多数时候,学生被允许访问网络上的任何内容。因此,由于访问非教育内容(例如流式非教育视频和下载大图像文件等)而消耗了全部带宽。由于各种原因,实际上很难防止非教育内容使用Internet。通常,这是通过维护URL黑名单在代理服务器中实现的。大多数情况下,这是URL的静态列表。随着万维网上内容的快速增长,保持静态黑名单是不切实际的。在本文中,我们提出了一种使用机器学习技术生成URL动态黑名单的方法。我们通过实验研究了几种机器学习算法,以预测所关注的URL是教育性的还是非教育性的。初始实验的结果表明,线性支持向量机可用于以98.9%的准确度预测含量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号