Using K-means Clustering to Detect Anomalous File Removes

机译：使用k-means群集来检测异常文件删除

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

One of the purposes of a data archive is to preserve irreplaceable data for future studies and generations. There are a number of ways that data can be lost from an archive, including accidental or malicious deletion of data. While there is a lot of software that can check for specific known threats or problems on a system, detecting non-specific anomalous behavior, such as unusual file removal patterns, is harder. One approach to detecting this kind of problem is machine learning. Machine learning algorithms can build a statistical model of what constitutes normal behavior and then flag data points that are outliers. To help protect the 87 petabytes of data in the National Center for Atmospheric Research's data archive, we explored our file removal patterns and implemented a k-means clustering solution to detect anomalous file removes. This approach can also be used to detect other anomalies, such as operational inconsistencies.

机译：数据存档的一个目的是为未来的研究和几代保留不可替代的数据。有许多方法可以从存档中丢失数据，包括意外或恶意删除数据。虽然有很多可以在系统上检查特定的已知威胁或问题的软件，但是检测非特定的异常行为，例如不寻常的文件删除模式，更加困难。检测到这种问题的一种方法是机器学习。机器学习算法可以构建构成正常行为的统计模型，然后构建一个正常行为的统计模型，然后标记为异常值的数据点。为了帮助保护国家大气研究的数据存档中的87个PB的数据，我们探讨了我们的文件删除模式并实现了K-Means群集解决方案以检测异常文件删除。这种方法也可用于检测其他异常，例如操作不一致。

著录项

来源
《International Conference on Artificial Intelligence》|2018年|484p|共5页
会议地点
作者
B. Anderson; M. Genty;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
Archive; Data science; Machine learning; Metrics; Analysis; Cybersecurity;

机译：档案;数据科学;机器学习;指标;分析;网络安全;
入库时间 2022-08-21 05:49:04

相似文献

外文文献
中文文献
专利

1. Using a Method Based on a Modified K-Means Clustering and Mean Shift Segmentation to Reduce File Sizes and Detect Brain Tumors from Magnetic Resonance (MRI) Images [J] . Kim JiHoon, Lee Sanghun, Lee GangSeong, Wireless personal communications: An Internaional Journal . 2016,第3期

机译：使用基于改进的K均值聚类和均值漂移分割的方法来减小文件大小并从磁共振（MRI）图像中检测脑肿瘤
2. Minkowski metric, feature weighting and anomalous cluster initializing in K-Means clustering [J] . Cordeiro De Amorim R., Mirkin B. Pattern Recognition: The Journal of the Pattern Recognition Society . 2012,第3期

机译：K均值聚类中的Minkowski度量，特征加权和异常聚类初始化
3. PERFORMANCE OF K-MEANS CLUSTERING AND BIRD FLOCKING ALGORITHM FOR GROUPING THE WEB LOG FILES [J] . R. SUGUNA, D. SHARMILA International Journal of Engineering Science and Technology . 2012,第10期

机译：Web日志文件分组的K均值聚类和Bird植群算法的性能
4. Using K-means Clustering to Detect Anomalous File Removes [C] . B. Anderson, M. Genty International Conference on Artificial Intelligence . 2018

机译：使用k-means群集来检测异常文件删除
5. Single-File and Anomalous Diffusion in Porous Carbons. [D] . Moore, Joshua Daniel. 2010

机译：多孔碳中的单次扩散和异常扩散。
6. Invisible Facial Flushing in Two Cases of Dengue Infection and Influenza Detected by PC Program and Smartphone App: Decorrelation Stretching and K-Means Clustering [O] . Manote Arpornsuwan, Matinun Arpornsuwan 2020

机译：通过PC程序和智能手机应用程序检测到的两例登革热感染和流感隐性面部潮红：去相关拉伸和K均值聚类
7. Detecting abuses in archaeological areas using k-mean clustering analysis and UAVs/drones data [O] . Abdalrahman Qubaa, Saja Al-Hamdani 2021

机译：使用k平均聚类分析和无人机/无人机数据检测考古区中的滥用

Using K-means Clustering to Detect Anomalous File Removes

摘要

著录项

相似文献

相关主题

期刊订阅