首页> 外国专利> METHOD FOR RECOGNIZING CHARACTER STRING OF JAPANESE PROSAIC OR COLLOQUIAL SENTENCE AS WORD STREAM BY COMPUTER PROCESSING AND SOFTWARE RECORDING MEDIUM

METHOD FOR RECOGNIZING CHARACTER STRING OF JAPANESE PROSAIC OR COLLOQUIAL SENTENCE AS WORD STREAM BY COMPUTER PROCESSING AND SOFTWARE RECORDING MEDIUM

机译:通过计算机处理和软件记录介质识别日语专业或口语句子的特征字符串的方法

摘要

PROBLEM TO BE SOLVED: To make a computer recognize a sentence such as prose including colloquial expressions as a word. SOLUTION: A database storing a lot of sample word sets prepared on the basis of a lot of sample sentences is prepared. A subject constitutive word composing of the subject of a processing object sentence composed of KANA/ KANJI mixed character strings is extracted. The database is retrieved with the subject constitutive word as a keyword and the word set including this word is extracted as a subject related sample word set. It is retrieved whether the word included in the subject related sample word set is included in the character string of the processing object or not and when such a word is included, it is recognized as a word and breaks are inserted before and after that word.
机译:解决的问题:使计算机识别诸如口头表达作为单词的散文之类的句子。 SOLUTION:准备了一个数据库,该数据库存储了根据大量示例句子准备的大量示例单词集。提取由由假名/汉字混合字符串组成的处理对象语句的主题组成的主题组成词。以主题构成词作为关键词来检索数据库,并且提取包括该词的词集合作为主题相关样本词集合。检索包括在对象相关样本单词集中的单词是否包括在处理对象的字符串中,并且当包括该单词时,将该单词识别为单词,并在该单词之前和之后插入断点。

著录项

  • 公开/公告号JP2001051993A

    专利类型

  • 公开/公告日2001-02-23

    原文格式PDF

  • 申请/专利权人 GALA INC;

    申请/专利号JP19990229086

  • 发明设计人 KIKUKAWA AKIRA;

    申请日1999-08-13

  • 分类号G06F17/22;

  • 国家 JP

  • 入库时间 2022-08-22 01:27:34

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号