首页> 外国专利> METHOD AND DEVICE FOR AUTOMATICALLY PROOFREADING CHINESE DOCUMENT

METHOD AND DEVICE FOR AUTOMATICALLY PROOFREADING CHINESE DOCUMENT

机译:自动校对中文文档的方法和设备

摘要

PROBLEM TO BE SOLVED: To automatically detect and correct a misused character and a lacking character in a Chinese document. SOLUTION: A character-to-reading conversion part 200 converts an inputted source document into a reading symbol string. A candidate word detection part 300 cuts a syllable out of the reading symbol string and uses it as a retrieval key to detect possible candidate words and relative information. A similar candidate word detection part 400 uses a reading symbol string after similar bits are masked by a mask means as a retrieval key to detect possible candidate words and relative information. An optimum candidate character string determination part 500 concatenates the respective candidate words by using the start and end position of each candidate word corresponding to the source document as retrieval keys to generate a directional net and takes an optimum path out by dynamic programming by regarding the cumulative maximum value of use frequency plus word length weight plus source document similarity weight plus meaning similarity weight. A matching part 600 matches the character string of the optimum path with the source document to detect and mark dissembling characters.
机译:解决的问题:自动检测并纠正中文文档中的误用字符和缺少字符。解决方案:字符-阅读转换部分200将输入的源文档转换为阅读符号字符串。候选词检测部分300从读取符号串中切出一个音节,并将其用作检索关键字以检测可能的候选词和相关信息。相似候选词检测部分400在通过掩蔽装置将相似比特掩蔽之后使用读取符号串作为检索关键字,以检测可能的候选词和相对信息。最佳候选字符串确定部分500通过使用与源文档相对应的每个候选词的开始和结束位置作为检索关键字来连接各个候选词,以生成方向网,并通过考虑累积量,通过动态编程来找出最佳路径。使用频率最大值加上字长权重加上源文档相似度权重加上含义相似度权重。匹配部分600将最佳路径的字符串与源文档进行匹配,以检测并标记反汇编字符。

著录项

  • 公开/公告号JPH10269204A

    专利类型

  • 公开/公告日1998-10-09

    原文格式PDF

  • 申请/专利权人 MATSUSHITA ELECTRIC IND CO LTD;

    申请/专利号JP19970077354

  • 发明设计人 KAKU SHUNKITSU;

    申请日1997-03-28

  • 分类号G06F17/21;G06F17/27;

  • 国家 JP

  • 入库时间 2022-08-22 03:05:38

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号