首页> 外文会议>CIKM 10;ACM conference on information and knowledge management >Using Wikipedia Categories for Compact Representations of Chemical Documents
【24h】

Using Wikipedia Categories for Compact Representations of Chemical Documents

机译:使用Wikipedia类别对化学文档进行紧凑表示

获取原文

摘要

Today, Web pages are usually accessed using text search engines, whereas documents stored in the deep Web are accessed through domain-specific Web portals. These portals rely on external knowledge bases, respectively ontologies, mapping documents to more general concepts allowing for suitable classifications and navigational browsing. Since automatically generated ontologies are still not satisfactory for advanced information retrieval tasks, most portals heavily rely on hand-crafted domain-specific ontologies. This, however, also leads to high creation and maintaining costs. On the other hand, a freely available community maintained, if somewhat general, knowledge base is offered by Wikipedia. During the last years the coverage of Wikipedia has reached a large pool of information including articles from almost all domains. In this paper, we investigate the use of Wikipedia categories to describe the content of chemical documents in a compact form. We compare the results to the domain-specific ChEBI ontology and the results show that Wikipedia categories indeed allow useful descriptions for chemical documents that are even better than descriptions from the ChEBI ontology.
机译:如今,通常使用文本搜索引擎来访问Web页面,而通过特定于域的Web门户访问深层Web中存储的文档。这些门户网站分别依赖于外部知识库和本体,将文档映射到更通用的概念,从而可以进行适当的分类和导航浏览。由于自动生成的本体对于高级信息检索任务仍然不令人满意,因此大多数门户严重依赖手工制作的特定于域的本体。然而,这也导致高的创建和维护成本。另一方面,维基百科提供了一个免费的社区,该社区维护了某种程度上的通用知识库。在过去的几年中,Wikipedia的覆盖范围已经达到了很大的信息量,包括来自几乎所有领域的文章。在本文中,我们调查了使用Wikipedia类别以紧凑形式描述化学文档的内容。我们将结果与特定领域的ChEBI本体进行了比较,结果表明Wikipedia类别的确为化学文献提供了有用的描述,甚至比ChEBI本体的描述更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号