首页> 外国专利> METHOD FOR GENERATION OF ROOT-BASED LANGUAGE MODEL AND LANGUAGE PROCESSING APPARATUS FOR SAME

METHOD FOR GENERATION OF ROOT-BASED LANGUAGE MODEL AND LANGUAGE PROCESSING APPARATUS FOR SAME

机译:相同语言的基于语言的语言模型的生成方法和语言处理装置

摘要

The present invention relates to a method for generation of a root-based language model and a language processing apparatus for the same. In particular, the method and the language processing apparatus are configured to extract each sentence, included in voice data, in a unigram form through language processing, analyze morphemes for each sentence in a unigram form, perform clustering based on roots of the morphemes, and match a cluster in an N-gram form, thereby overcoming a data shortage problem as in a class-based language model. Furthermore, the method and the language processing apparatus can be modified and utilized in such a way that using a linguistic relationship, an object which can be placed in front of a verb is mainly extracted in the case of the verb and the following relationship is extracted in the case of a noun having a postposition, thereby improving accuracy in language extraction. Furthermore, a new noun which is not present in a language model for voice recognition can be added in an N-gram form incorporating various expressions of a sentence, thereby providing higher voice recognition performance for a proper noun than that of a method of simply adding only the proper noun.;COPYRIGHT KIPO 2016
机译:本发明涉及用于生成基于根的语言模型的方法和用于该方法的语言处理设备。特别地,该方法和语言处理设备被配置为通过语言处理以字母组合形式提取语音数据中包括的每个句子,以字母组合形式分析每个句子的词素,基于词素的词根进行聚类,以及以N-gram形式匹配集群,从而克服了基于类的语言模型中的数据短缺问题。此外,可以以如下方式修改和利用该方法和语言处理设备:利用语言关系,在动词的情况下主要提取可以放置在动词前面的对象,并且提取以下关系。在名词具有后置词的情况下,从而提高了语言提取的准确性。此外,可以以包含句子的各种表达的N-gram形式添加在语音识别的语言模型中不存在的新名词,从而比简单添加方法提供更高的专有名词语音识别性能。仅专有名词。; COPYRIGHT KIPO 2016

著录项

  • 公开/公告号KR20160060915A

    专利类型

  • 公开/公告日2016-05-31

    原文格式PDF

  • 申请/专利权人 SK TELECOM. CO. LTD.;

    申请/专利号KR20140163104

  • 发明设计人 KIM YOUNG JOONKR;

    申请日2014-11-21

  • 分类号G10L15/187;G10L15/26;G10L15/28;

  • 国家 KR

  • 入库时间 2022-08-21 14:14:20

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号