首页> 外国专利> Word vector generation device, word vector generation method, program, and recording medium

Word vector generation device, word vector generation method, program, and recording medium

机译:词向量生成装置,词向量生成方法,程序以及记录介质

摘要

PPROBLEM TO BE SOLVED: To provide a word vector generation device for storing co-occurrence information of all different words with an arbitrary word, suppressing the number of dimensions of the co-occurrence vectors, and generating co-occurrence vectors to reduce the computational complexity of processing to the co-occurrence vectors by suppressing the number of dimensions of the co-occurrence vectors, and to provide a word vector generation method, a program, and a recording medium. PSOLUTION: A word vector generation device is configured to associate some of N components with each of words in a word dictionary, to morphologically analyze an input text, to generate a co-occurrence matrix between the obtained group of different words and the group of N components, and to store a value calculated by adding the frequency that an arbitrary word A co-occurs with arbitrary components B associated with the words in a predetermined range of the text over the predetermined range in the text, into an element in the co-occurrence matrix corresponding to the word A and the components B. PCOPYRIGHT: (C)2010,JPO&INPIT
机译:

要解决的问题:提供一种词向量生成装置,用于存储所有不同词与任意词的共现信息,抑制共现向量的维数,并生成共现向量以减少通过抑制同现向量的维数来处理同现向量的计算复杂度,并提供词向量生成方法,程序和记录介质。

解决方案:单词向量生成设备配置为将N个成分中的一些与单词词典中的每个单词相关联,以形态分析输入文本,以在获得的一组不同单词与单词之间生成共现矩阵。 N个元素的集合,并将通过将任意单词A与与文本的预定范围内的单词相关联的任意分量B共同出现的频率加到文本中的预定范围内而计算出的值存储到文本中的元素中

COPYRIGHT:(C)2010,JPO&INPIT对应于单词A和成分B的共现矩阵。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号