首页> 外文期刊>Peer-to-peer networking and applications >Classifying peer-to-peer applications using imbalanced concept-adapting very fast decision tree on IP data stream - Springer
【24h】

Classifying peer-to-peer applications using imbalanced concept-adapting very fast decision tree on IP data stream - Springer

机译:在IP数据流上使用不平衡的概念自适应非常快速的决策树对点对点应用程序进行分类-Springer

获取原文
获取原文并翻译 | 示例
           

摘要

Peer-to-Peer (P2P) applications generate streaming data in large volumes, where new communities of peers regularly attend and existing communities of peers regularly leave, requiring the classification techniques to consider concept drift, and update the model incrementally. Concept-adapting Very Fast Decision Tree (CVFDT) is one of the well-known streaming data mining techniques that can be applied to P2P traffic. However, we observe that P2P traffic data is class imbalanced, namely, only about 30 % of examples can be labeled as “P2P”, biasing the trained models (e.g. decision tree) towards the majority class (i.e. “NonP2P”). In this paper, we propose a new technique, the imbalanced CVFDT (iCVFDT), by integrating the CVFDT with an efficient resampling technique to address the issue of the class imbalanced data. The iCVFDT classification technique enjoys the advantages of CVFDT (such as stability), and at the same time, is not sensitive to imbalanced data. We captured the Internet traffic at a main gateway and prepared a real data stream with 3.5 million examples to which the iCVFDT classification technique was applied. The experimental results demonstrate a significant improvement in the performance of the iCVFDT compared to that of the CVFDT.
机译:对等(P2P)应用程序会大量生成流数据,新的对等社区定期参加,而现有的对等社区定期离开,这需要分类技术考虑概念漂移并逐步更新模型。适应概念的超快速决策树(CVFDT)是可以应用于P2P流量的众所周知的流数据挖掘技术之一。但是,我们发现P2P流量数据是类别不平衡的,即,只有大约30%的示例可以标记为“ P2P”,从而将训练有素的模型(例如决策树)偏向多数类别(即“ NonP2P”)。在本文中,我们提出了一种新技术,即不平衡CVFDT(iCVFDT),它是通过将CVFDT与有效的重采样技术相集成来解决类不平衡数据的问题。 iCVFDT分类技术具有CVFDT的优点(例如稳定性),并且同时对不平衡数据不敏感。我们在主网关处捕获了Internet流量,并使用iCVFDT分类技术准备了包含350万个示例的真实数据流。实验结果表明,与CVFDT相比,iCVFDT的性能有了显着提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号