首页> 外文会议>International Conference on Artificial Intelligence >Using K-means Clustering to Detect Anomalous File Removes
【24h】

Using K-means Clustering to Detect Anomalous File Removes

机译:使用k-means群集来检测异常文件删除

获取原文

摘要

One of the purposes of a data archive is to preserve irreplaceable data for future studies and generations. There are a number of ways that data can be lost from an archive, including accidental or malicious deletion of data. While there is a lot of software that can check for specific known threats or problems on a system, detecting non-specific anomalous behavior, such as unusual file removal patterns, is harder. One approach to detecting this kind of problem is machine learning. Machine learning algorithms can build a statistical model of what constitutes normal behavior and then flag data points that are outliers. To help protect the 87 petabytes of data in the National Center for Atmospheric Research's data archive, we explored our file removal patterns and implemented a k-means clustering solution to detect anomalous file removes. This approach can also be used to detect other anomalies, such as operational inconsistencies.
机译:数据存档的一个目的是为未来的研究和几代保留不可替代的数据。有许多方法可以从存档中丢失数据,包括意外或恶意删除数据。虽然有很多可以在系统上检查特定的已知威胁或问题的软件,但是检测非特定的异常行为,例如不寻常的文件删除模式,更加困难。检测到这种问题的一种方法是机器学习。机器学习算法可以构建构成正常行为的统计模型,然后构建一个正常行为的统计模型,然后标记为异常值的数据点。为了帮助保护国家大气研究的数据存档中的87个PB的数据,我们探讨了我们的文件删除模式并实现了K-Means群集解决方案以检测异常文件删除。这种方法也可用于检测其他异常,例如操作不一致。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号