首页> 外国专利> Large scale concept discovery for webpage augmentation using search engine indexers

Large scale concept discovery for webpage augmentation using search engine indexers

机译:使用搜索引擎索引器进行网页增强的大规模概念发现

摘要

Disclosed is a method and system for retrieving data; extracting information from the data; learning to disambiguate the extracted information such that a particular sense of each phrase within the extracted information is determined; generating a disambiguation classifier from the learning to disambiguate step, the disambiguation classifier configured to determine a sense of a phrase within a document; learning to select a portion of the information as being relevant to a theme of the data; generating a selection classifier from the learning to select step, the selection classifier configured to select a topic in a document that is relevant to a theme of the document; and using the disambiguation classifier and the selection classifier by an indexing computer to determine a set of topics from a web document retrieved by the indexing computer.
机译:公开了一种数据检索方法和系统。从数据中提取信息;学习消除所提取信息的歧义,从而确定所提取信息中每个短语的特定含义;从学习到消歧步骤生成消歧分类器,该消歧分类器被配置为确定文档内的短语的含义;学习选择与信息的主题相关的信息的一部分;从学习选择步骤中生成选择分类器,选择分类器被配置为选择文档中与文档主题相关的主题;索引计算机使用消歧分类器和选择分类器从索引计算机检索的Web文档中确定一组主题。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号