首页> 外国专利> INFORMATION ENRICHMENT USING GLOBAL STRUCTURE LEARNING

INFORMATION ENRICHMENT USING GLOBAL STRUCTURE LEARNING

机译:利用全球结构学习进行信息丰富

摘要

Methods, systems and computer program products implementing data enrichment using global structure learning are disclosed. An information enrichment system predicts a likely canonical name from a transaction record in which names may be shortened, or extra token(s) inserted. In a training phase, the information enrichment system determines tag patterns based on labeled and unlabeled training transaction records. The tag patterns include co-occurrence probability and sequential order of co-occurrence of tags. In a testing phase, the information enrichment system receives a test transaction record. The information enrichment system predicts a likely tag sequence from the test transaction record based on the tag patterns. The information enrichment system predicts a canonical name based on likely tag values and token composition. The information enrichment system can then enrich the test transaction record with the predicted canonical name.
机译:公开了使用全局结构学习来实现数据丰富化的方法,系统和计算机程序产品。信息丰富系统根据可能会缩短名称或插入额外令牌的交易记录来预测可能的规范名称。在培训阶段,信息丰富系统根据已标记和未标记的培训交易记录确定标签模式。标签模式包括标签的共现概率和标签的共现顺序。在测试阶段,信息丰富系统接收测试交易记录。信息丰富系统根据标签样式从测试交易记录中预测可能的标签序列。信息丰富系统根据可能的标签值和令牌组成来预测规范名称。然后,信息丰富系统可以使用预测的规范名称来丰富测试事务记录。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号