首页> 外国专利> KOREAN NAMED ENTITIES RECOGNITION METHOD BASED ON MAXIMUM ENTROPY MODEL AND NEURAL NETWORK MODEL

KOREAN NAMED ENTITIES RECOGNITION METHOD BASED ON MAXIMUM ENTROPY MODEL AND NEURAL NETWORK MODEL

机译:基于最大熵模型和神经网络模型的韩国命名实体识别方法

摘要

The invention belongs to the technical field of named entities recognition, and discloses a Korean named entities recognition method on the basis of a maximum entropy model and a neural network model. The method comprises: building a prefix tree dictionary, when any one combined noun template is matched with any one proper noun template in an input sentence, it is recognized as a target word; obtaining the target word in a target word selection module, searching the target word in an entity dictionary, and when only one subclass is matched, the subclass serves as a tag of the target word; adopting the maximum entropy model, and utilizing various linguistics information; constructing a feedforward neural network model; and grouping adjacency words into an entity tag through a template selection rule. All data used in the method is extracted in a training corpus with tags and a field-independent entity dictionary, the data is very easily migrated into other application fields, and the performance cannot be reduced obviously.
机译:本发明属于命名实体识别技术领域,公开了一种基于最大熵模型和神经网络模型的韩国命名实体识别方法。该方法包括:建立前缀树词典,当输入句子中任意一个组合名词模板与任意一个专有名词模板匹配时,将其识别为目标词;在目标词选择模块中获取目标词,在实体字典中搜索目标词,当仅匹配一个子类时,该子类作为目标词的标签;采用最大熵模型,并利用各种语言学信息;建立前馈神经网络模型;通过模板选择规则将邻接词分组为实体标签。该方法中使用的所有数据都在带有标签和字段无关实体字典的训练语料库中提取,这些数据很容易迁移到其他应用程序领域,并且性能不会明显降低。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号