首页> 外国专利> Method and apparatus for maximum entropy modeling, and method and apparatus for natural language processing using the same

Method and apparatus for maximum entropy modeling, and method and apparatus for natural language processing using the same

机译:用于最大熵建模的方法和装置,以及使用该方法和装置进行自然语言处理的方法和装置

摘要

A maximum entropy modeling method is provided which is capable of selecting valid feature functions by excluding invalid feature functions, reducing a modeling time and realizing a high accuracy. The maximum entropy modeling method includes: a first step (S1) of setting an initial value for a current model; a second step (S2) of setting a set of feature functions as a candidate set; a third step (S3) of comparing observed probabilities of respective feature functions included in the candidate set with estimated probabilities of the feature functions according to a current model, and determining the feature functions to be excluded from the candidate set; a fourth step (S4) of adding the remaining feature functions included in the candidate set after excluding the feature functions to be excluded to the respective sets of feature functions of the current model, and calculating parameters of a maximum entropy model thereby to create a plurality of new approximate models; and a fifth step (S5) of calculating a likelihood of learning data using the approximate models, and replacing the current model with a model that is determined based on the likelihood of learning data.
机译:提供了一种最大熵建模方法,该方法能够通过排除无效特征函数来选择有效特征函数,从而减少建模时间并实现高精度。最大熵建模方法包括:为当前模型设置初始值的第一步(S1);第二步(S2),设置特征集作为候选集;第三步骤(S3),比较候选集合中包括的各个特征函数的观测概率与根据当前模型的特征函数的估计概率,并确定要从候选集合中排除的特征函数;第四步骤(S4),在将要排除的特征函数排除之后,将候选集合中包括的其余特征函数添加到当前模型的各个特征函数集,并计算最大熵模型的参数,从而创建多个新的近似模型;第五步骤(S5),使用所述近似模型来计算学习数据的可能性,并且将当前模型替换为基于学习数据的可能性而确定的模型。

著录项

  • 公开/公告号US2002188421A1

    专利类型

  • 公开/公告日2002-12-12

    原文格式PDF

  • 申请/专利权人 TANIGAKI KOICHI;ISHIKAWA YASUSHI;

    申请/专利号US20020092557

  • 发明设计人 KOICHI TANIGAKI;YASUSHI ISHIKAWA;

    申请日2002-03-08

  • 分类号G06F15/00;G06F17/18;G06F101/14;

  • 国家 US

  • 入库时间 2022-08-22 00:11:36

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号