Based on the information service platform of environmental protection floor enterprise and combined with the technologies of Big Data and vertical search engine,this paper designs and realizes the floor industry vertical search engine based on Hadoop distributed platform.The system can access the Internet floor information through WebMagic,manage index by Solr,stores the information in the HDFS and HBASE and achieve a lightweight floor industry vertical search engine.By comparing with the existing floor industry related search function.%以环保地坪企业信息化服务平台为背景,结合行业大数据和垂直搜索引擎技术应用特点,设计并实现了基于Hadoop分布式平台的环保地坪行业垂直搜索引擎。引擎系统主要通过WebMagic获取互联网中地坪信息,使用Solr管理索引,进而将这些信息存储在HDFS和HBASE框架中,通过技术集成实现一个轻量级的地坪行业垂直搜索引擎。提出的环保地坪行业搜索引擎与目前已有地坪行业相关搜索功能比较,表明该搜索引擎在搜索精度和速度性能上达到有效改善。
展开▼