首页> 中文期刊>计算机应用研究 >一种基于改进的权值调整技术数据源分类算法研究

一种基于改进的权值调整技术数据源分类算法研究

     

摘要

针对传统的搜索引擎无法正确搜索到Deep Web中隐藏的海量信息,对Web数据库的分类是通向Web数据库分类集成和检索的关键步骤.提出了一种基于权值调整技术的Deep Web数据库分类方法,首先从网页表单中提取特征;然后对这些特征使用一种新的权重计算方法进行估值;最后利用朴素贝叶斯分类器对Web数据库进行分类.实验表明,这种分类方法经过少量样本训练后,就能达到很好的分类效果,并且随着训练样本的增加,该分类器的性能保持稳定,准确率、召回率都在很小的范围内波动.%The traditional search engine is unable to correct search for the magnanimous information in Deep Web hides. The Web database' s classification is the key step which integrates with the Web database classification and retrieves. This paper proposed a kind of classification of Deep Web data sources based on weight adjustment technique, which, used a new weight adjustment method to valuate the weight of feature extracted from the homepage form, and finally used the simple Bayes sorter to classify the Web database. The experiment indicates that after this taxonomic approach undergoes few sample training, it can achieve the very good classified effect, and along with training sample' s increase, this classifier' s performance maintains stable and the rate of accuracy and the recalling rate fluctuate in the very small scope.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号