首页> 外国专利> NATURAL LANGUAGE PROCESSING SYSTEM AND METHOD FOR WORD REPRESENTATIONS IN NATURAL LANGUAGE PROCESSING

NATURAL LANGUAGE PROCESSING SYSTEM AND METHOD FOR WORD REPRESENTATIONS IN NATURAL LANGUAGE PROCESSING

机译:自然语言处理中的单词表示系统和方法

摘要

The present invention relates to a natural language processing system and a word expression method in natural language processing, in a word expression method in natural language processing performed by a natural language processing system, a) a vocabulary including at least one word and each word providing a vocabulary dictionary dataset including previously learned word embedding information; b) When a vocabulary based on the vocabulary dictionary dataset is provided as input data, subword information about words existing in the input data is extracted using a word expression model, and the subword information is embedded in words calculating a vector; and c) matching the calculated word embedding vector with the pre-learned word embedding information of the corresponding word, replacing the pre-learned word embedding information with the calculated word embedding vector to learn a word expression for the word. wherein the word expression model includes a convolutional neural network-based convolutional neural network that calculates lower-order word feature vectors using the lower-order word information, and the lower-order word feature vectors calculated by the convolution module. It includes a highway network-based highway module that adaptively combines to calculate a word embedding vector of a corresponding word.
机译:本发明涉及一种自然语言处理中的自然语言处理中的自然语言处理中的自然语言处理中的单词表达方法,该方法由自然语言处理系统执行,a)一种词汇表,包括至少一个单词和每个单词提供的词汇词汇字典数据集包括以前学习的单词嵌入信息; b)当基于词汇字典数据集的词汇提供作为输入数据时,使用Word表达式提取输入数据中存在的关于存在于输入数据中的字的子字信息,并且子字信息被嵌入计算矢量;并且c)将计算出的单词嵌入向量与相应单词的预先学习的单词嵌入信息匹配,用计算出的单词嵌入向量替换预先学习的单词嵌入信息以学习单词的单词表达式。其中,字形表达式模型包括基于卷积神经网络的卷积神经网络,其使用较低的字信息计算低阶字特征向量,以及由卷积模块计算的低阶字特征向量。它包括一个基于公路网络的高速公路模块,其自适应地组合来计算嵌入相应单词的单词矢量。

著录项

  • 公开/公告号KR102260646B1

    专利类型

  • 公开/公告日2021-06-07

    原文格式PDF

  • 申请/专利权人

    申请/专利号KR1020190080967

  • 发明设计人 이상근;김예찬;

    申请日2019-07-04

  • 分类号G06F40/40;G06N3/08;

  • 国家 KR

  • 入库时间 2022-08-24 19:14:21

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号