首页> 外国专利> Automated category discovery for a terminological knowledge base

Automated category discovery for a terminological knowledge base

机译:术语知识库的自动类别发现

摘要

A terminological system automatically generates sub-categories from categories of a knowledge base. The knowledge base includes a plurality of hierarchically arranged categories, as well as terms associated with the categories. A subset of the categories of the knowledge base are designated “dimensional categories.” The system also stores a corpus of documents, including themes and corresponding theme weights for each document. A target category is selected to generate sub-categories. A set of themes from the corpus of documents are selected for each term. Dimensional category vectors, one for each term, are generated by associating the set of themes for a term to a dimensional category in the knowledge base. The dimensional category vectors for each term are analyzed to determine if one or more clusters of terminological groups exist to generate new sub-categories. A content processing system, which generates themes and theme weights, is also disclosed.
机译:术语系统会根据知识库的类别自动生成子类别。知识库包括多个按层次结构排列的类别以及与这些类别关联的术语。知识库类别的子集被指定为“维度类别”。该系统还存储文档的语料库,包括每个文档的主题和相应的主题权重。选择目标类别以生成子类别。为每个术语选择一系列文档主题。通过将一个术语的主题集与知识库中的维类别相关联,可以生成每个术语一个的维类别向量。分析每个术语的维类别向量,以确定是否存在一个或多个术语组簇以生成新的子类别。还公开了一种生成主题和主题权重的内容处理系统。

著录项

  • 公开/公告号US6513027B1

    专利类型

  • 公开/公告日2003-01-28

    原文格式PDF

  • 申请/专利权人 ORACLE CORPORATION;

    申请/专利号US19990270319

  • 发明设计人 JAMES CONKLIN;JOSHUA POWERS;

    申请日1999-03-16

  • 分类号G06N50/00;

  • 国家 US

  • 入库时间 2022-08-22 00:04:44

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号