首页> 外国专利> Method for extracting profiles and topics from a first file written in a first markup language and generating files in different markup languages containing the profiles and topics for use in accessing data described by the profiles and topics

Method for extracting profiles and topics from a first file written in a first markup language and generating files in different markup languages containing the profiles and topics for use in accessing data described by the profiles and topics

机译:从以第一标记语言编写的第一文件中提取配置文件和主题并生成包含配置文件和主题的不同标记语言的文件的方法,用于访问配置文件和主题描述的数据

摘要

A computer-implemented method and system for of retrieving information. A first file of information is received which includes a first markup language to identify contents of the information. Responsive to the receiving the first file of information, the first file of information is parsed to generate a list of profiles, and at least one corresponding topic for each of the list of profiles. A second file in a second markup language is created containing the list of the profiles and at least one corresponding third file is created in a third markup language for the at least one corresponding topic for each of the list of profiles. The second file contains anchors referencing each at least one corresponding third file, and first markup instances in the first file of information are converted to second markup instances in either the second file or the third file. The first file of information is parsed to determine the at least one article, if any, for the each at least one corresponding topic for the each of the list of profiles, and a corresponding brief for the at least one article. A fourth file and a fifth file are generated for the at least one article, if any, for the each at least one corresponding topic for the each of the list of profiles. The fourth file includes a brief of each the at least one article in the first file of information and an anchor to the fifth file, the fifth file including text for the at least one article, if any, for the each at least one corresponding topic for the each of the list of profiles.
机译:一种用于检索信息的计算机实现的方法和系统。接收第一信息文件,该第一文件包括用于标识信息内容的第一标记语言。响应于接收到第一信息文件,第一信息文件被解析以生成简档列表,以及针对简档列表中的每一个的至少一个对应主题。创建第二标记语言的第二文件,其包含简档列表,并且针对第三简档列表的每一个,为第三标记语言创建至少一个对应的第三文件,用于第三标记语言。第二文件包含引用每个至少一个对应的第三文件的锚,并且第一信息文件中的第一标记实例被转换为第二文件或第三文件中的第二标记实例。解析第一信息文件以确定每个简档列表的每个至少一个对应主题的至少一个文章(如果有的话),以及至少一个文章的对应摘要。针对至少一个文章(如果有的话),为每个简档列表的每个至少一个对应主题生成第四文件和第五文件。第四文件包括第一信息文件中至少一个文章的每一个的摘要以及第五文件的锚点,第五文件包括用于至少一个相应主题的至少一个文章的文本(如果有的话)。对于每个配置文件列表。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号