首页> 外国专利> SYSTEMS AND METHODS FOR IDENTIFICATION OF REPETITIVE LANGUAGE IN DOCUMENT USING LINGUISTIC ANALYSIS AND CORRECTION THEREOF

SYSTEMS AND METHODS FOR IDENTIFICATION OF REPETITIVE LANGUAGE IN DOCUMENT USING LINGUISTIC ANALYSIS AND CORRECTION THEREOF

机译:使用语言分析及其纠正识别文献中重复语言的系统和方法

摘要

The present disclosure relates to processing operations configured to provide a linguistic-based approach to evaluating repetition in content of an electronic document. The approach of the present disclosure is about detecting terms/words/phrases that are likely to be perceived as being repetitious by native speakers of a language rather than just identifying the occurrence of identical words or strings in a document as done by traditional language checks. Processing of the present disclosure detects and evaluates terms or phrases using positive linguistic evidence derived from evaluation of linguistic relationships between words in a string in syntactic ways. This results in more accurate and efficient determination as to whether a term is truly repetitious at the linguistic level as compared with traditional language checks. As compared with string-based evaluation, fewer flags are raised for identification of repetitive/ over-used language, but more precise/accurate identification of repetition occurs using processing of the present disclosure.
机译:本公开涉及被配置为提供基于语言的基于语言的方法来评估电子文档的内容中的重复。本公开的方法是关于检测可能被被认为的术语/单词/短语由语言母语者被认为是重复的,而不是仅仅识别通过传统语言检查所完成的文档中的相同单词或字符串的发生。处理本公开的处理使用句法方式使用串之间的单词之间的语言关系评估来检测和评估术语或短语。这导致与传统语言检查相比,更准确和更有效地确定如何在语言水平上真正重复。与基于字符串的评估相比,提出了更少的标志以识别重复/过用的语言,但是使用本公开的处理发生更准确/准确的重复识别。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号