首页> 外国专利> Generating a Graph Data Structure that Identifies Relationships among Topics Expressed in Web Documents

Generating a Graph Data Structure that Identifies Relationships among Topics Expressed in Web Documents

机译:生成图形数据结构,该结构标识Web文档中主题之间的关系

摘要

A technique produces a graph data structure based on at least partially unstructured information dispersed over web documents. The technique involves applying a machine-trained model to a set of documents (or, more generally “document units”) to identify topics in the documents. The technique then generates count information by counting the occurrences of the single topics and co-occurrences of parings of topics in the documents. The technique generates conditional probability information based on the count information. An instance of conditional probability information describes a probability that a first topic will occur, given an appearance of a second topic, and a probability that the second topic will occur, given an appearance of the first topic. The technique then formulates the conditional probability information in a graph data structure. The technique also provides an application system that utilizes the graph data structure to provide any kind of computer-implemented service to a user.
机译:一种技术基于分散在Web文档上的至少部分非结构化信息产生图数据结构。该技术涉及将机器训练模型应用于一组文档(或更通常“文档单元”)以识别文档中的主题。然后,该技术通过计算文档中的单个主题和共同分析的单个主题和共同发生的发生来生成计数信息。该技术基于计数信息生成条件概率信息。条件概率信息的实例描述了给定第二主题的外观,以及给定第一个主题的外观时发生第二主题的概率的概率。然后,该技术在图形数据结构中制定条件概率信息。该技术还提供了一种应用系统,该应用系统利用图形数据结构来向用户提供任何类型的计算机实现的服务。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号