首页> 外文会议>International Conference on Asian Digital libraries >Using Content-Based and Link-Based Analysis in Building Vertical Search Engines
【24h】

Using Content-Based and Link-Based Analysis in Building Vertical Search Engines

机译:在构建垂直搜索引擎中使用基于内容和基于链路的分析

获取原文

摘要

This paper reports our research in the Web page filtering process in specialized search engine development. We propose a machine-learning-based approach that combines Web content analysis and Web structure analysis. Instead of a bag of words, each Webpage is represented by a set of content-based and link-based features, which can be used as the input for various machine learning algorithms. The proposed approach was implemented using both a feedforward/backpropagation neural network and a support vector machine. An evaluation study was conducted and showed that the proposed approaches performed better than the benchmark approaches.
机译:本文报告了我们在专业搜索引擎开发中的网页过滤过程中的研究。我们提出了一种基于机器学习的方法,结合了Web内容分析和Web结构分析。代替一袋单词,每个网页由一组基于内容和链路的特征表示,可以用作各种机器学习算法的输入。使用前馈/背部化神经网络和支持向量机来实现所提出的方法。进行了评估研究,并表明所提出的方法比基准方法更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号