【24h】

Topic Models for Taxonomies

机译:分类法的主题模型

获取原文

摘要

Concept taxonomies such as MeSH. the ACM Computing Classification System, and the NY Times Subject Headings are frequently used to help organize data. They typically consist of a set of concept names organized in a hierarchy. However, these names and structure are often not sufficient to fully capture the intended meaning of a taxonomy node, and particularly non-experts may have difficulty navigating and placing data into the taxonomy. This paper introduces two semi-supervised topic models that automatically augment a given taxonomy with many additional keywords by leveraging a corpus of multi-labeled documents. Our experiments show that users find the topics beneficial for taxonomy interpretation, substantially increasing their cataloging accuracy. Furthermore, the models provide a better information rate compared to Labeled LDA [7].
机译:概念分类法(例如MeSH)。 ACM计算分类系统和《纽约时报》主题标题经常用于帮助组织数据。它们通常由按层次结构组织的一组概念名称组成。但是,这些名称和结构通常不足以完全掌握分类法节点的预期含义,尤其是非专家可能难以导航并将数据放入分类法中。本文介绍了两个半监督的主题模型,这些模型通过利用多标签文档的语料库,自动使用许多其他关键字来增强给定的分类法。我们的实验表明,用户发现了有助于分类学解释的主题,从而大大提高了他们的编目准确性。此外,与标记的LDA相比,这些模型提供了更好的信息率[7]。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号