首页> 外文会议>Proceedings of the 1st ACM SIGKDD Workshop on Knowledge Discovery from Uncertain Data >Exploiting contexts to deal with uncertainty in classification
【24h】

Exploiting contexts to deal with uncertainty in classification

机译:利用上下文来处理分类中的不确定性

获取原文
获取原文并翻译 | 示例

摘要

Uncertainty is often inherent to data and still there are just a few data mining algorithms that handle it. In this paper we focus on how to account for uncertainty in classification algorithms, in particular when data attributes should not be considered completely truthful for classifying a given sample. Our starting point is that each piece of data comes from a potentially different context and, by estimating context probabilities of an unknown sample, we may derive a weight that quantifies their influence. We propose a lazy classification strategy that incorporates the uncertainty into both the training and usage of classifiers. We also propose uK-NN, an extension of the traditional K-NN that implements our approach. Finally, we illustrate uK-NN, which is currently being evaluated experimentally, using a document classification toy example.
机译:不确定性通常是数据固有的,仍然只有少数数据挖掘算法可以处理。在本文中,我们着重于如何解决分类算法中的不确定性,特别是在不应将数据属性视为对给定样本进行分类时完全真实的情况下。我们的出发点是每条数据都来自潜在的不同上下文,并且通过估计未知样本的上下文概率,我们可以得出权重来量化它们的影响。我们提出了一种惰性分类策略,该策略将不确定性纳入分类器的训练和使用中。我们还提出了uK-NN,它是实现我们方法的传统K-NN的扩展。最后,我们使用文档分类玩具示例说明了uK-NN,目前正在通过实验对其进行评估。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号