首页> 外国专利> Computer-based system and method for finding the rule of law in text

Computer-based system and method for finding the rule of law in text

机译:在文本中查找法治的基于计算机的系统和方法

摘要

A system and method for binary classification of text units such as sentences, paragraphs and documents as either a rule of law (ROL) or not a rule of law (~ROL).During a training phase of the system and method of the present invention, an initialized knowledge base and labeled or pre-classified sentences are used to build a trained knowledge base. The trained knowledge base contains an equation, a threshold, and a plurality of statistical values called Z values.When inputting text documents for classification, a Z value is generated for each term or token in the input text. The Z values are input to the equation which calculates a score for each sentence. Each calculated score is then compared to the threshold to classify each sentence as either ROL or ~ROL.
机译:一种用于对文本单元(例如句子,段落和文档)进行文本分类的系统和方法,它们是法治(ROL)还是法治(〜ROL)。在本发明的系统和方法的培训阶段,已初始化的知识库以及带标签或预分类的句子将用于构建经过培训的知识库。训练有素的知识库包含一个方程式,一个阈值和多个统计值,称为Z值。输入分类的文本文档时,将为输入文本中的每个术语或标记生成一个Z值。 Z值输入到等式,该等式计算每个句子的分数。然后将每个计算出的分数与阈值进行比较,以将每个句子分类为ROL或〜ROL。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号