首页> 外国专利> Automated content filter and URL translation for dynamically generated web documents

Automated content filter and URL translation for dynamically generated web documents

机译:自动化的内容过滤器和URL转换,可动态生成Web文档

摘要

Embodiments provide a method, process and apparatus for filtering a request from a client and building the response to that request using mapping tables. These mapping tables are utilized to present content-related information about hypertext documents that can be dynamically generated from a database, on one or more servers. The dynamically generated hypertext documents may be web pages for the World Wide Web portion of the Internet. The mapping table is used to automatically generate a mapping page to best match its intended viewer's request. A mapping page designed to be viewed by a computer system will be presented in a format optimized for use by a web crawler program to build an index of web pages that may be generated at the server site. A mapping page designed to be viewed by a person will be presented in a human readable format, with optimizations made based on how that user arrived at the page. A site operator will enter the basic information required to generate the first mapping table entries, including information required to build a data access algorithm. Data used in these mapping tables, including the URL (uniform resource locator), keyword data and content, is fetched by an automated web browser (spider) through the HTTP (hyper text transport protocol) transport using the data access algorithm generated. Site operators may specify initial logical data groupings. Mapping table entries may be continuously updated, and subsequent entries may be automatically generated based on the criteria that was used in the requesting query. Individual table entries may be influenced by a predetermined algorithm as designated by the industry that the site operator has selected. ;An additional embodiment provides a method, process and apparatus allowing a human to train a program that creates the mapping table, showing the apparatus various methods for finding dynamically generated data by example. The apparatus then uses the examples to generate the mapping table and the resulting mapping algorithms.
机译:实施例提供了一种用于过滤来自客户端的请求并使用映射表建立对该请求的响应的方法,过程和装置。这些映射表用于呈现有关可以从一台或多台服务器上的数据库动态生成的超文本文档的内容相关信息。动态生成的超文本文档可以是Internet万维网部分的网页。映射表用于自动生成映射页面,以最匹配其预期查看者的请求。设计为由计算机系统查看的映射页将以一种优化的格式呈现,以供Web爬网程序用来构建可能在服务器站点上生成的网页索引。设计成供人查看的地图页面将以人类可读的格式显示,并基于该用户到达页面的方式进行优化。站点操作员将输入生成第一张映射表条目所需的基本信息,包括构建数据访问算法所需的信息。这些映射表中使用的数据(包括URL(统一资源定位符),关键字数据和内容)由自动Web浏览器(蜘蛛)通过HTTP(超文本传输​​协议)传输使用生成的数据访问算法来获取。站点操作员可以指定初始逻辑数据分组。映射表条目可以连续更新,并且可以基于在请求查询中使用的标准自动生成后续条目。各个表条目可能会受到站点运营商选择的行业指定的预定算法的影响。另一个实施例提供了一种方法,过程和设备,该方法,过程和设备允许人们训练创建映射表的程序,该设备通过示例示出了用于查找动态生成的数据的各种方法。然后,该设备使用示例来生成映射表和所得的映射算法。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号