首页> 外国专利> Compound word break estimation device, method, and program for estimating compound word break position

Compound word break estimation device, method, and program for estimating compound word break position

机译:复合词断点估计装置,估计复合词断点位置的方法和程序

摘要

PPROBLEM TO BE SOLVED: To provide a compound word break estimation device, method and program for estimating whether a word registered in a dictionary in advance as well as a word not registered is a compound word or not, and for estimating a proper break position when the word is the compound word. PSOLUTION: This compound word break estimation device is provided with: a learning data storage part for storing information showing whether or not each of a plurality of words is a compound word configured of a plurality of morphemes and a break position between the plurality of morphemes configuring the compound word in the case of a compound word; a similarity calculation part for calculating similarity between the vector of an unknown word vectorized by using the featured value of each of characters included in the word by a vectorization processing part and each of vectors of the known words stored in a plurality of learning data storage parts; and an estimation part for estimating whether or not the unknown word is the compound word, and for estimating the break position between the morphemes of the unknown word being a compound word on the basis of the similarity. PCOPYRIGHT: (C)2010,JPO&INPIT
机译:

要解决的问题:提供一种复合单词中断估计装置,方法和程序,用于估计预先在词典中注册的单词以及未注册的单词是否是复合单词,并且估计适当的单词。当单词是复合单词时,请中断位置。

解决方案:该复合词中断估计装置包括:学习数据存储部,用于存储表示多个词中的每一个是否是由多个词素构成的复合词以及多个词之间的中断位置的信息。在复合词的情况下构成复合词的语素的变化;相似度计算部分,用于计算通过向量化处理部分使用包含在单词中的每个字符的特征值而矢量化的未知单词的向量与存储在多个学习数据存储部分中的已知单词的每个向量之间的相似度;估计部分,用于估计未知词是否是复合词,并且基于相似度来估计作为复合词的未知词的词素之间的中断位置。

版权:(C)2010,日本特许厅&INPIT

著录项

  • 公开/公告号JP4979637B2

    专利类型

  • 公开/公告日2012-07-18

    原文格式PDF

  • 申请/专利权人 ヤフー株式会社;

    申请/专利号JP20080149909

  • 发明设计人 増山 毅司;平村 昇子;

    申请日2008-06-06

  • 分类号G06F17/27;

  • 国家 JP

  • 入库时间 2022-08-21 17:40:20

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号