首页> 外国专利> Method for detecting stylistic errors and generating replacement strings in a document containing Japanese text

Method for detecting stylistic errors and generating replacement strings in a document containing Japanese text

机译:在包含日语文本的文档中检测样式错误并生成替换字符串的方法

摘要

Critiques are applied to phrase units included in morpho- syntactical information derived from the Japanese text. The critiques include a "trigger" and an "action," and are written in a special- purpose syntax that allows for easy specification of the error class and the rewrite generation. If a critique's trigger condition is satisfied, the associated action is carried out in order to generate a replacement text string. The process of generating replacement text strings employs a morphological graph that reflects possible word formations. In a first pass through the graph, a breadth first search is used to identify intermediate nodes along a path whose morpheme transitions satisfy at least part of the attributes of the text. In a second pass, a depth first search is used to select only those morpheme transitions that completely satisfy the rewrite criteria specified in the critique, while traversing the nodes identified in the breadth first search. The morpheme transitions identified during the depth first search provide the text string that is used to replace the original phrase.
机译:批判应用于源自日语文本的词法句法信息中包含的短语单元。批判包括“触发”和“动作”,并以特殊目的的语法编写,可轻松指定错误类和重写生成。如果满足评论的触发条件,则执行关联的操作以生成替换文本字符串。生成替换文本字符串的过程采用反映可能的单词格式的形态图。在通过图形的第一遍中,使用广度优先搜索来识别沿词素转换至少满足文本属性部分的路径的中间节点。在第二遍中,深度优先搜索用于遍历在广度优先搜索中标识的节点时,仅选择完全满足评论中指定的重写条件的词素转换。在深度优先搜索期间识别出的语素过渡提供了用于替换原始短语的文本字符串。

著录项

  • 公开/公告号US6035269A

    专利类型

  • 公开/公告日2000-03-07

    原文格式PDF

  • 申请/专利权人 MICROSOFT CORPORATION;

    申请/专利号US19980103207

  • 发明设计人 HYUN-SUK KIM;

    申请日1998-06-23

  • 分类号G06F17/27;G06F17/28;G06F17/21;

  • 国家 US

  • 入库时间 2022-08-22 01:37:39

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号