Detecting Irrelevant Subtrees to Improve Probabilistic Learning from Tree-structured Data

Amaury Habrard; Marc Bernard; Marc Sebban

首页> 外文期刊>Fundamenta Informaticae >Detecting Irrelevant Subtrees to Improve Probabilistic Learning from Tree-structured Data

【24h】

Detecting Irrelevant Subtrees to Improve Probabilistic Learning from Tree-structured Data

机译：检测不相关的子树以提高树状结构数据的概率学习

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In front of the large increase of the available amount of structured data (such as XML documents), many algorithms have emerged for dealing with tree-structured data. In this article, we present a probabilistic approach which aims at a priori pruning noisy or irrelevant subtrees in a set of trees. The originality of this approach, in comparison with classic data reduction techniques, comes from the fact that only a part of a tree (i.e. a subtree) can be deleted, rather than the whole tree itself. Our method is based on the use of confidence intervals, on a partition of subtrees, computed according to a given probability distribution. We propose an original approach to assess these intervals on tree-structured data and we experimentally show its interest in the presence of noise.

机译：在大量可用的结构化数据（例如XML文档）面前，出现了许多用于处理树状结构数据的算法。在本文中，我们提出了一种概率方法，该方法旨在先验修剪一组树中的嘈杂或不相关的子树。与经典的数据约简技术相比，这种方法的独创性在于，只能删除树的一部分（即子树），而不是整个树本身。我们的方法基于对子树的划分使用的置信区间，并根据给定的概率分布进行计算。我们提出了一种原始方法来评估树结构数据上的这些间隔，并通过实验表明其对存在噪声的兴趣。

著录项

来源
《Fundamenta Informaticae》 |2005年第2期|p.103-130|共28页
作者
Amaury Habrard; Marc Bernard; Marc Sebban;
展开▼
作者单位

EURISE -Universite Jean Monnet de Saint-Etienne 23, rue du Dr Paul Michelon 42023 Saint-Etienne cedex 2, France;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
data reduction; tree-structured data; noisy data; stochastic tree automata;

机译：数据约简;树状结构数据;噪声数据;随机树自动机;

相似文献

外文文献
中文文献
专利

1. Enumeration Algorithms for All Characteristic Paths and Subtrees from Structurally Compressed Tree-Structured Data [J] . Tomoya Horibe, Yuko Itokawa, Tomoyuki Uchida, IAENG Internaitonal journal of computer science . 2018,第1pta118a227期

机译：结构压缩的树状结构数据中所有特征路径和子树的枚举算法
2. Complete Discovery of Weighted Frequent Subtrees in Tree-Structured Datasets [J] . Rahman AliMohammadzadeh, Ashkan Zarnani MasoudRahgozar, Mostafa H. Chehreghani International journal of computer science and network security . 2006,第8A期

机译：完全发现树结构化数据集中的加权频繁子树
3. Nicotine improves probabilistic reward learning in wildtype but not alpha7 nAChR null mutants, yet alpha7 nAChR agonists do not improve probabilistic learning [J] . Milienne-Petiot Morgane, Higa Kerin K., Grim Andrea, European neuropsychopharmacology: the journal of the European College of Neuropsychopharmacology . 2018,第11期

机译：尼古丁在野生型中提高了概率奖励学习，但不是alpha7 nachr null突变体，但alpha7 nachr激动剂不会改善概率学习
4. Detecting Irrelevant Subtrees to Improve Probabilistic Learning from Tree-structured Data [C] . Amaury Habrard, Marc Bernard, Marc Sebban Advances in Mining Graphs, Trees and Sequences . 2005

机译：检测无关的子树以改善树木结构数据的概率学习
5. Graph-based data analysis: Tree-structured covariance estimation, prediction by regularized kernel estimation and aggregate database query processing for probabilistic inference. [D] . Bravo, Hector Corrada. 2008

机译：基于图的数据分析：树状协方差估计，通过正则核估计进行预测以及用于概率推断的聚合数据库查询处理。
6. Detecting representative data and generating synthetic samples to improve learning accuracy with imbalanced data sets [O] . Der-Chiang Li, Susan C. Hu, Liang-Sian Lin, -1

机译：检测代表性数据并生成合成样本以提高不平衡数据集的学习准确性
7. Detecting Irrelevant subtrees to improve probabilistic learning from tree-structured data [O] . Habrard Amaury, Bernard Marc, Sebban Marc 2005

机译：检测不相关的子树以改善从树状结构数据中的概率学习

Detecting Irrelevant Subtrees to Improve Probabilistic Learning from Tree-structured Data

摘要

著录项

相似文献

相关主题

期刊订阅