首页> 外文会议>International Workshop on Knowledge Discovery and Data Mining >Aggregation of Information Resources on the Invisible Web
【24h】

Aggregation of Information Resources on the Invisible Web

机译:无形网络上信息资源的聚合

获取原文

摘要

There are huge numbers of valuable information resources resided on Invisible Web. However, it is hard to use for us. In this paper we propose a system called NewsReaper that is capable of making Invisible Web to be visible, especially the huge number of real-time information, which update frequently and are time-sensitive. NewsReaper makes use of information extraction, text classification, full text index, RSS technologies to aggregate Invisible Web information resource. In first, this paper analyzes the reasons why it is invisible and four types of Invisible Web. At the same time it summarizes the characteristics of Invisible Web. On this basis, the system architecture of NewsReaper will to be introduced. In order to verify this system, there is a test for campus recruitment information compared with general-purpose search engines. Finally, the weaknesses of this system and some further works are discussed.
机译:在隐形Web上存在大量有价值的信息资源。但是,很难为我们使用。在本文中,我们提出了一个名为Newsereal的系统,该系统能够使隐形Web可见,尤其是频繁更新的大量实时信息,并且时间敏感。 Newsereaper利用信息提取,文本分类,全文索引,RSS技术,以聚合不可见Web信息资源。首先,本文分析了它是不可见的原因和四种类型的隐形网络。同时总结了隐形Web的特征。在此基础上,将介绍新的新闻系统架构。为了验证该系统,与通用搜索引擎相比,对校园招聘信息进行了测试。最后,讨论了该系统的弱点和一些进一步的作品。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号