首页> 外文期刊>Mathematical Problems in Engineering: Theory, Methods and Applications >Keyword Search over Probabilistic XML Documents Based on Node Classification
【24h】

Keyword Search over Probabilistic XML Documents Based on Node Classification

机译:基于节点分类的概率XML文档关键词搜索

获取原文
       

摘要

This paper describes a keyword search measure on probabilistic XML data based on ELM (extreme learning machine). We use this method to carry out keyword search on probabilistic XML data. A probabilistic XML document differs from a traditional XML document to realize keyword search in the consideration of possible world semantics. A probabilistic XML document can be seen as a set of nodes consisting of ordinary nodes and distributional nodes. ELM has good performance in text classification applications. As the typical semistructured data; the label of XML data possesses the function of definition itself. Label and context of the node can be seen as the text data of this node. ELM offers significant advantages such as fast learning speed, ease of implementation, and effective node classification. Set intersection can compute SLCA quickly in the node sets which is classified by using ELM. In this paper, we adopt ELM to classify nodes and compute probability. We propose two algorithms that are based on ELM and probability threshold to improve the overall performance. The experimental results verify the benefits of our methods according to various evaluation metrics.
机译:本文介绍了一种基于ELM(极限学习机)的概率XML数据的关键词搜索方法。我们使用这种方法对概率XML数据执行关键字搜索。考虑到可能的世界语义,概率XML文档与传统XML文档在实现关键字搜索方面有所不同。概率XML文档可以看作是由普通节点和分布节点组成的一组节点。 ELM在文本分类应用程序中具有良好的性能。作为典型的半结构化数据; XML数据的标签本身具有定义功能。节点的标签和上下文可以视为该节点的文本数据。 ELM具有很多优势,例如学习速度快,易于实现以及有效的节点分类。集合交集可以在使用ELM分类的节点集中快速计算SLCA。在本文中,我们采用ELM对节点进行分类并计算概率。我们提出了两种基于ELM和概率阈值的算法,以提高整体性能。实验结果根据各种评估指标证明了我们方法的好处。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号