首页> 外国专利> APPARATUS FOR CREATING ALIGNMENT CORPUS BASED ON UNSUPERVISED ALIGNMENT AND METHOD THEREOF, AND APPARATUS FOR PERFORMING MORPHOLOGICAL ANALYSIS OF NON-CANONICAL TEXT USING THE ALIGNMENT CORPUS AND METHOD THEREOF

APPARATUS FOR CREATING ALIGNMENT CORPUS BASED ON UNSUPERVISED ALIGNMENT AND METHOD THEREOF, AND APPARATUS FOR PERFORMING MORPHOLOGICAL ANALYSIS OF NON-CANONICAL TEXT USING THE ALIGNMENT CORPUS AND METHOD THEREOF

机译:基于非监督的对齐方式创建对齐语料的装置及其方法,以及使用对齐语料对非规范文本进行形态分析的装置及其方法

摘要

Disclosed are an alignment corpus generation device based on unsupervised learning alignment, a method thereof, a morphological analysis device for analyzing non-canonical expressions using alignment corpus, and a morphological analysis method thereof. The morphological analysis device includes a knowledge database and an analyzer. The knowledge database stores a plurality of knowledge items used for morphological analysis in each language and includes: a morpheme dictionary in which morpheme data corresponding to canonical expressions and an aligned corpus in which canonical expressions corresponding to non-canonical expressions - wrong-spelled expressions and unnormalized or unstandardized expressions- are stored. The analyzer performs morphological analysis of an inputted phrase using the knowledge database and outputs the analysis result. If no morpheme exists for the inputted phrase in the morpheme dictionary, the present invention preforms morphological analysis by finding a canonical expression corresponding to the non-canonical expression using the alignment corpus for the non-canonical expression included in the inputted phrase.
机译:公开了基于无监督学习对齐的对齐语料库生成设备,其方法,使用对齐语料库分析非规范表达的形态分析设备及其形态分析方法。形态分析设备包括知识数据库和分析器。知识数据库存储用于每种语言的词法分析的多个知识项,并且包括:词素词典,其中对应于规范表达的词素数据;以及对齐语料库,其中对齐的语料库对应于非规范表达-拼写错误的表达和存储非标准化或非标准化的表达式。分析器使用知识数据库对输入短语进行形态分析,并输出分析结果。如果在词素词典中对于输入的短语不存在词素,则本发明通过使用包括在输入的短语中的非规范表达的比对语料来找到与非规范表达相对应的规范表达来进行形态分析。

著录项

  • 公开/公告号KR101509727B1

    专利类型

  • 公开/公告日2015-04-07

    原文格式PDF

  • 申请/专利权人 주식회사 시스트란인터내셔널;

    申请/专利号KR20130118062

  • 发明设计人 지창진;

    申请日2013-10-02

  • 分类号G06F17/27;G06F17/30;

  • 国家 KR

  • 入库时间 2022-08-21 14:58:24

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号