首页> 外文会议>International Conference on Management of Technology >Research of vertical search engine in news industry
【24h】

Research of vertical search engine in news industry

机译:新闻业垂直搜索引擎的研究

获取原文

摘要

In summing up the existing network of reptiles, and full-text retrieval based on theoretical knowledge, conducted a Web crawler optimization algorithm so that it can adapt to the needs of vertical search engines, and then sub-word component of Pango and Lucene.Net build an efficient full-text search functions. The innovation of the paper is the analysis of the characteristics of news sites to integrate its features into the traditional vertical search engines. News site on the information requirements for the characteristics of the network by studying the relevant full-text search framework to multithreaded data collection and retrieval of the vertical search engine, performance and user experience goals are to achieve abetter. The entire system by small and medium news site commissioning tests designed to meet the test show that the crawlers can adapt to the new network news industry efficient and timely collection requirements, Lucene.Net segmentation. The integration of Pango built the content for news and information Full-text retrieval system can achieve the accuracy of search engine queries for information and efficient response time demands, thereby increasing the amount of information and user experience.
机译:在总结现有爬行动物网络和基于理论知识的全文检索的基础上,进行了Web爬虫优化算法,使其能够适应垂直搜索引擎的需求,然后对Pango和Lucene.Net进行了分词组件的设计。建立高效的全文本搜索功能。本文的创新之处在于分析新闻站点的特征,以将其特征集成到传统的垂直搜索引擎中。新闻网站对信息的要求是针对网络特征的,通过研究相关的全文搜索框架,以多线程数据收集和检索垂直搜索引擎,在性能和用户体验方面均达到了更好的目标。整个系统通过对中小型新闻网站的调试测试而设计,以满足测试表明该爬虫可以适应新的网络新闻行业高效,及时收集的要求,Lucene.Net进行了细分。 Pango的集成构建了新闻和信息的内容全文检索系统可以实现搜索引擎查询信息的准确性和有效的响应时间要求,从而增加了信息量和用户体验。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号