首页> 外国专利> Concept-based method and system for dynamically analyzing unstructured information

Concept-based method and system for dynamically analyzing unstructured information

机译:基于概念的动态分析非结构化信息的方法和系统

摘要

A method, operating model, system, data structure, computer program and computer program product for analyzing and categorizing unstructured information is provided such that conventional structured data access techniques can be utilized over unstructured objects. A analysis and categorization engine builds a set of concept groupings, each grouping consisting of related words and phrases. The concept groupings are augmented by user input. A set of categories is built. The analysis and categorization engine generates a vector representation of each object based on concepts and utilizes a statistical analysis to select concepts that represent each object and assign objects to categories. Information about users, objects, and categories is stored in an open architecture, such as a relational database. An object concept based search is provided to efficiently locate meaningful objects and to provide for updating of the object categorization based on search entries.
机译:提供了一种用于对非结构化信息进行分析和分类的方法,操作模型,系统,数据结构,计算机程序和计算机程序产品,从而可以在非结构化对象上利用常规的结构化数据访问技术。分析和分类引擎将构建一组概念分组,每个分组均由相关的单词和短语组成。用户输入可以增强概念分组。建立了一组类别。分析和分类引擎基于概念生成每个对象的矢量表示,并利用统计分析来选择代表每个对象的概念并将对象分配给类别。有关用户,对象和类别的信息存储在开放式体系结构中,例如关系数据库。提供基于对象概念的搜索以有效地定位有意义的对象并提供基于搜索条目的对象分类的更新。

著录项

  • 公开/公告号US6970881B1

    专利类型

  • 公开/公告日2005-11-29

    原文格式PDF

  • 申请/专利权人 RENGASWAMY MOHAN;USHA MOHAN;

    申请/专利号US20020087053

  • 发明设计人 USHA MOHAN;RENGASWAMY MOHAN;

    申请日2002-03-01

  • 分类号G06F17/30;

  • 国家 US

  • 入库时间 2022-08-21 21:40:46

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号