首页> 外文期刊>Neural computing & applications >Using Voronoi diagrams to improve classification performances when modeling imbalanced datasets
【24h】

Using Voronoi diagrams to improve classification performances when modeling imbalanced datasets

机译:在对不平衡数据集进行建模时,使用Voronoi图改善分类性能

获取原文
获取原文并翻译 | 示例
           

摘要

An over-sampling technique called V-synth is proposed and compared to borderline SMOTE (bSMOTE), a common methodology used to balance an imbalanced dataset for classification purposes. V-synth is a machine learning methodology that allows synthetic minority points to be generated based on the properties of a Voronoi diagram. A Voronoi diagram is a collection of geometric regions that encapsulate classifying points in such a way that any point within the region is closest to the encapsulated classifier than any other adjacent classifiers based on their distance from one another. Because of properties inherent to Voronoi diagrams, V-synth identifies exclusive regions of feature space where it is ideal to create synthetic minority samples. To test the generalization and application of V-synth, six databases from various problem domains were selected from the University of California Irvine's Machine Learning Repository. Though not always guaranteed due to the random nature of synthetic over-sampling, significant evidence is presented that supports the hypothesis that V-synth more consistently leads to the creation of more accurate and better-balanced classification models than bSMOTE when the classification complexity of a dataset is high.
机译:提出了一种称为V-synth的过采样技术,并将其与边界SMOTE(bSMOTE)(边界线SMOTE)进行比较,边界线SMOTE是一种用于平衡不平衡数据集以进行分类的通用方法。 V-synth是一种机器学习方法,可以根据Voronoi图的属性生成综合少数点。 Voronoi图是几何区域的集合,这些几何区域以这样一种方式封装分类点,即区域内的任何点都基于彼此之间的距离比任何其他相邻分类器最接近封装的分类器。由于Voronoi图具有固有的属性,因此V-synth可以识别特征空间的排他区域,是创建合成少数样本的理想选择。为了测试V-synth的泛化和应用,从加利福尼亚大学尔湾分校的机器学习存储库中选择了来自各个问题领域的六个数据库。尽管由于合成过采样的随机性而不能始终保证,但有大量证据支持以下假设:当a的分类复杂度比bSMOTE稳定时,V-synth可以更准确地创建平衡模型。数据集很高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号