首页> 外国专利> METHOD FOR ATTRIBUTION OF PARTIALLY STRUCTURED TEXTS FOR FORMATION OF NORMATIVE-REFERENCE INFORMATION

METHOD FOR ATTRIBUTION OF PARTIALLY STRUCTURED TEXTS FOR FORMATION OF NORMATIVE-REFERENCE INFORMATION

机译:用于形成规范参考信息的部分结构化文本的方法

摘要

FIELD: computer technology. ;SUBSTANCE: invention relates to computer technology. The method for attributing partially structured texts for generating normative-reference information includes selecting a training set of texts in the natural language of partially structured texts, extracting the appropriate set of features for each category of named entities, training a classification model using the training set of texts and sets of features for each category of named entities, performing training using attributes, obtaining a model for each named entity and checking attributes, extracting tokens from unmarked text by the processor, generating a marked-up representation by the processor of at least a part of the text based on at least one of the tokens classified by categories.;EFFECT: increased speed of data attribution processes. ;1 cl, 2 dwg, 1 tbl
机译:领域:计算机技术。物质:本发明涉及计算机技术。用于生成规范参考信息的部分结构化文本的方法包括选择部分结构化文本的自然语言中的训练组文本集合,从而提取每个类别的命名实体的适当特征集,使用训练集培训分类模型每个类型名称实体的文本和特征集,使用属性执行培训,获取每个命名实体和检查属性的模型,处理器从未标记的文本中提取令牌,至少由处理器生成标记表示基于由类别分类的至少一个令牌的文本的一部分。;效果:数据归因过程的速度增加。 ; 1 cl,2 dwg,1 tbl

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号