首页> 外文会议>The Semantic Web - ASWC 2006; Lecture Notes in Computer Science; 4185 >Hierarchical Topic Term Extraction for Semantic Annotation in Chinese Bulletin Board System
【24h】

Hierarchical Topic Term Extraction for Semantic Annotation in Chinese Bulletin Board System

机译:中文公告板系统中语义标注的层次主题词抽取

获取原文
获取原文并翻译 | 示例

摘要

With the current growing interest in the Semantic Web, the demand for ontological data has been on the verge of emergency. Currently many structured and semi-structured documents have been applied for ontology learning and annotation. However, most of the electronic documents on the web are plain-text, and these texts are still not well utilized for the Semantic Web. In this paper, we propose a novel method to automatically extract topic terms to generate a concept hierarchy from the data of Chinese Bulletin Board System (BBS), which is a collection of plain-text. In addition, our work provides the text source associated with the extracted concept as well, which could be a perfect fit for the semantic search application that makes a fusion of both formal and implicit semantics. The experimental results indicate that our method is effective and the extracted concept hierarchy is meaningful.
机译:随着当前对语义网的兴趣日益增长,对本体数据的需求已迫在眉睫。当前,许多结构化和半结构化文档已被用于本体学习和注释。但是,网络上的大多数电子文档都是纯文本的,并且这些文本仍不能很好地用于语义网。在本文中,我们提出了一种新颖的方法,该方法可以自动从中文公告板系统(BBS)的数据中提取主题词以生成概念层次结构,该数据是纯文本的集合。此外,我们的工作还提供与提取的概念相关联的文本源,这可能非常适合将形式语义与隐式语义融合的语义搜索应用程序。实验结果表明,该方法是有效的,提取的概念层次结构是有意义的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号