首页> 外文会议>IEEE/WIC/ACM Joint International Conference on Web Intelligence and Intelligent Agent Technology >Concept Extraction and Clustering for Topic Digital Library Construction
【24h】

Concept Extraction and Clustering for Topic Digital Library Construction

机译:主题数字图书馆建设概念提取与聚类

获取原文

摘要

This paper is to introduce a new approach to build topic digital library using concept extraction and document clustering. Firstly, documents in a special domain are automatically produced by document classification approach. Then, the keywords of each document are extracted using the machine learning approach. The keywords are used to cluster the documents subset. The clustered result is the taxonomy of the subset. Lastly, the taxonomy is modified to the hierarchical structure for user navigation by manual adjustments. The topic digital library is constructed after combining the full-text retrieval and hierarchical navigation function.
机译:本文介绍了一种使用概念提取和文档聚类构建主题数字库的新方法。首先,通过文档分类方法自动生成特殊域中的文档。然后,使用机器学习方法提取每个文档的关键字。关键字用于群集文档子集。聚类结果是子集的分类。最后,通过手动调整修改了分类物到用户导航的分层结构。主题在组合全文检索和分层导航功能后构建数字库。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号