首页> 外文会议>International Conference on Computational Science and Its Applications(ICCSA 2004) pt.1; 20040514-20040517; Assisi; IT >A Document Classification Algorithm Using the Fuzzy Set Theory and Hierarchical Structure of Document
【24h】

A Document Classification Algorithm Using the Fuzzy Set Theory and Hierarchical Structure of Document

机译:基于模糊集理论和文档层次结构的文档分类算法

获取原文
获取原文并翻译 | 示例

摘要

In present, Information retrieval systems which are simply expressed with combination between keywords and phrase search according to the direct keyword matching method to get the information which users need. But Web documents retrieval systems serve too many documents because of term ambiguity. Also it often happens that words with several meanings occur in a document, but in a rather different context from that expected by the querying person. So the user should need extra time and effort to get more close documents. To overcome these problems, in this paper we propose an information retrieval system based on the content, which connects documents according to the degree of semantic link which it express fuzzy value by fuzzy function. Also we propose an algorithm which it produce the hierarchical structure using the degree of concepts and contents among documents. As result, we are able to select and to provide user-interested documents.
机译:目前的信息检索系统,是根据关键词直接匹配方法,通过关键词与词组搜索的组合来简单表示,以获取用户所需的信息。但是由于术语含糊不清,Web文档检索系统为太多文档提供服务。同样经常发生的是,具有多种含义的单词出现在文档中,但是与查询者所期望的上下文完全不同。因此,用户应该花费更多的时间和精力来获取更多的详细文档。为了克服这些问题,本文提出了一种基于内容的信息检索系统,该系统根据语义链接的程度对文档进行连接,并通过模糊函数表达模糊值。我们还提出了一种算法,该算法利用文档之间的概念和内容的程度来生成层次结构。因此,我们能够选择并提供用户感兴趣的文档。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号