首页> 外文会议>International Conference on Computer Design and Applications >Hierachical Fuzzy Set-based Deep Web Source Classification
【24h】

Hierachical Fuzzy Set-based Deep Web Source Classification

机译:基于分层模糊集的深网络源分类

获取原文
获取外文期刊封面目录资料

摘要

This paper presents a classification method of data source using fuzzy set and probabilistic model. The words of each domain are classified into characteristic words and general words according to their contribution to the current domain. The fuzzy set is introduced into the simplification process of characteristic words and the common words as the normalized glossary tool, which can be able to find more precise glossary in the homepage text. And a vocabulary probabilistic model is build after the normalized process in various domains, these words are classified by calculating the distance between the data source form vector and each domain vector.
机译:本文介绍了使用模糊集和概率模型的数据源的分类方法。根据其对当前域的贡献,每个域的单词被分类为特征词和一般单词。模糊集被引入特征词的简化过程和常规词汇工具的常规词汇,这可以在主页文本中找到更精确的词汇表。并且在各个域中的归一化过程之后构建词汇概率模型,通过计算数据源形式向量和每个域向量之间的距离来分类这些词。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号