首页> 外国专利> WORD LATENT TOPIC ESTIMATION DEVICE AND WORD LATENT TOPIC ESTIMATION METHOD

WORD LATENT TOPIC ESTIMATION DEVICE AND WORD LATENT TOPIC ESTIMATION METHOD

机译:词隐题估计装置和词隐题估计方法

摘要

Provided are a word latent topic estimation device and a word latent topic estimation method which are capable of hierarchically performing processing and which are capable of rapidly estimating latent topics of a word while taking into consideration a mixed state of topics. The present invention is provided with: a document data addition unit (11) which inputs a document which includes one or more words; a level setting unit (12) which sets a number of topics at each level in accordance with a hierarchical structure of topics for hierarchically estimating latent topics of a word; a higher-level constraint creation unit (15) which, on the basis of results of topic estimation at a given level with regard to a word within the document, creates a higher-level constraint indicating an identifier of a topic for which there is a possibility of being assigned to the word and a probability of being assigned to the topic; and a higher-level-constraint-attached topic estimation unit (13) which, when estimating the probability of each word being assigned to each topic, refers to the higher-order constraint, uses the probability of being assigned to a parent topic at the higher level as a weight, and performs estimation processing to a lower-level topic.
机译:提供了一种单词潜在主题估计装置和单词潜在主题估计方法,其能够分级地执行处理,并且能够在考虑主题的混合状态的同时快速估计单词的潜在主题。本发明提供:文档数据添加单元( 11 ),其输入包括一个或多个单词的文档;以及级别设置单元( 12 ),其根据用于对单词的潜在主题进行层次估计的主题的层次结构,在每个级别上设置多个主题;一个较高级别的约束创建单元( 15 ),该单元根据给定级别上有关文档中单词的主题估计结果,创建一个较高级别的约束,该约束指示一个有可能被分配给单词的主题和被分配给该主题的可能性;以及附有较高级别约束的主题估计单元( 13 ),当估计将每个单词分配给每个主题的概率时,它参考较高阶约束,并使用在较高级别上分配给父主题作为权重,并对较低级别的主题执行估计处理。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号