【24h】

TagTheWeb: Using Wikipedia Categories to Automatically Categorize Resources on the Web

机译:TagTheWeb:使用Wikipedia类别自动对Web资源进行分类

获取原文

摘要

Identifying topics associated with a set of documents is a common task for many applications and can be used to improve various tasks involving documents on the Web, such as search, retrieval, recommendation, and clustering. To address this problem, this paper introduces a tool, called TagTheWeb, as a proposition of a generic classification method, that relies on the knowledge expressed by the taxo-nomic structure of Wikipedia, based on the generation of a fingerprint through the semantic relation between nodes of the Wikipedia Category Graph. TagTheWeb can be used as a WEB interface or as an API to classify any text based resource.
机译:标识与一组文档关联的主题是许多应用程序的常见任务,可用于改进涉及Web上文档的各种任务,例如搜索,检索,推荐和聚类。为了解决这个问题,本文介绍了一种称为TagTheWeb的工具,作为通用分类方法的命题,该工具依赖于Wikipedia的分类结构所表达的知识,其基础是通过指纹之间的语义关系生成指纹Wikipedia类别图的节点。 TagTheWeb可以用作WEB界面或API,以对任何基于文本的资源进行分类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号