首页> 外国专利> METHOD FOR CHINESE CONCEPT EMBEDDING GENERATION BASED ON WIKIPEDIA LINK STRUCTURE

METHOD FOR CHINESE CONCEPT EMBEDDING GENERATION BASED ON WIKIPEDIA LINK STRUCTURE

机译:基于维基百科链接结构的中文概念嵌入生成方法

摘要

The present invention discloses a method and a device for Chinese concept embedding generation based on Wikipedia link structure. The method includes: Step (1): According to the title concepts and/or link concepts in Chinese Wikipedia pages, a link information database is constructed; Step (2): For the title concepts, according to their link relationships with link concepts in the link information database, the positive and negative training instances are constructed respectively, which constitute the training dataset; Step (3): A concept embedding model is built, including an input layer, an embedding layer, a computational operation layer, and an output layer; Step (4): The concept embedding model is trained with the training dataset, then, the Chinese concept embedding is extracted/generated from the concept embedding model. The present invention can accurately distinguish different concepts and overcome the problem of polysemy that troubles the traditional embedding methods, which is beneficial to generate more accurate concept embedding representation.
机译:本发明公开了一种基于维基百科链接结构的中文概念嵌入生成方法和装置。该方法包括:步骤(1):根据中文维基百科页面的标题概念和/或链接概念,建立链接信息数据库;步骤(2):对于标题概念,根据其与链接信息数据库中链接概念的链接关系,分别构造正负训练实例,构成训练数据集;步骤(3):建立概念嵌入模型,包括输入层,嵌入层,计算操作层和输出层。步骤(4):利用训练数据集对概念嵌入模型进行训练,然后从概念嵌入模型中提取/生成中文概念嵌入。本发明可以准确地区分不同的概念,克服了困扰传统嵌入方法的多义性问题,有利于产生更准确的概念嵌入表示。

著录项

  • 公开/公告号LU101242A1

    专利类型

  • 公开/公告日2019-07-01

    原文格式PDF

  • 申请/专利权人 QILU UNIVERSITY OF TECHNOLOGY;

    申请/专利号LU20180101242

  • 发明设计人 LU WENPENG;

    申请日2018-10-26

  • 分类号G06F17/27;

  • 国家 LU

  • 入库时间 2022-08-21 12:02:00

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号