首页> 外文期刊>高分子論文集 >Pseudo-label neighborhood rough set: Measures and attribute reductions
【24h】

Pseudo-label neighborhood rough set: Measures and attribute reductions

机译:伪标签邻域粗糙集:度量和属性约简

获取原文
获取原文并翻译 | 示例
           

摘要

The scale of the radius for constructing neighborhood relation has a great effect on the results of neighborhood rough sets and corresponding measures. A very small radius frequently brings us nothing because any two different samples are separated from each other, though these two samples have the same label. If the radius is growing, then there is a serious risk that samples with different labels may fall into the same neighborhood. Obviously, the radius based neighborhood relation does not take the labels of samples into account, which will lead to unsatisfactory discrimination. To fill such gap, a pseudo-label strategy is systematically studied in rough set theory. Firstly, a pseudo-label neighborhood relation is proposed. Such relation can differentiate samples by not only the distance but also the pseudo labels of samples. Therefore, both the neighborhood rough set and some corresponding measures can be re-defined. Secondly, attribute reductions are explored based on the re-defined measures. The heuristic algorithm is also designed to compute reducts. Finally, the experimental results over UCI data sets tell us that our pseudo-label strategy is superior to the traditional neighborhood approach. This is mainly because the former can significantly reduce the uncertainties and improve the classification accuracies. The Wilcoxon signed rank test results also show that neighborhood approach and pseudo-label neighborhood approach are so different from the viewpoints of the measures and attribute reductions in rough set theory. (C) 2018 Elsevier Inc. All rights reserved.
机译:构造邻域关系的半径范围对邻域粗糙集的结果和相应的度量有很大的影响。很小的半径通常不会给我们带来任何好处,因为尽管这两个样本具有相同的标签,但是任何两个不同的样本都相互分离。如果半径在增加,则存在带有不同标签的样本可能落入相同邻域的严重风险。显然,基于半径的邻域关系没有考虑样本的标签,这将导致令人满意的区分。为了填补这一空白,在粗糙集理论中系统地研究了伪标签策略。首先,提出了伪标签邻域关系。这种关系不仅可以通过距离来区分样本,还可以通过样本的伪标记来区分样本。因此,邻域粗糙集和一些相应的度量都可以重新定义。其次,基于重新定义的度量探索属性约简。启发式算法还设计用于计算折减。最后,UCI数据集上的实验结果告诉我们,我们的伪标签策略优于传统的邻域方法。这主要是因为前者可以显着减少不确定性并提高分类准确性。 Wilcoxon符号秩检验结果还表明,从粗糙集理论的度量和属性约简的角度来看,邻域方法和伪标签邻域方法有很大不同。 (C)2018 Elsevier Inc.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号