...
首页> 外文期刊>Fundamenta Informaticae >Detecting Irrelevant Subtrees to Improve Probabilistic Learning from Tree-structured Data
【24h】

Detecting Irrelevant Subtrees to Improve Probabilistic Learning from Tree-structured Data

机译:检测不相关的子树以提高树状结构数据的概率学习

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

In front of the large increase of the available amount of structured data (such as XML documents), many algorithms have emerged for dealing with tree-structured data. In this article, we present a probabilistic approach which aims at a priori pruning noisy or irrelevant subtrees in a set of trees. The originality of this approach, in comparison with classic data reduction techniques, comes from the fact that only a part of a tree (i.e. a subtree) can be deleted, rather than the whole tree itself. Our method is based on the use of confidence intervals, on a partition of subtrees, computed according to a given probability distribution. We propose an original approach to assess these intervals on tree-structured data and we experimentally show its interest in the presence of noise.
机译:在大量可用的结构化数据(例如XML文档)面前,出现了许多用于处理树状结构数据的算法。在本文中,我们提出了一种概率方法,该方法旨在先验修剪一组树中的嘈杂或不相关的子树。与经典的数据约简技术相比,这种方法的独创性在于,只能删除树的一部分(即子树),而不是整个树本身。我们的方法基于对子树的划分使用的置信区间,并根据给定的概率分布进行计算。我们提出了一种原始方法来评估树结构数据上的这些间隔,并通过实验表明其对存在噪声的兴趣。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号