首页> 外国专利> SYSTEMS AND METHODS FOR IMPROVING ACCURACY OF CLASSIFICATION-BASED TEXT DATA PROCESSING

SYSTEMS AND METHODS FOR IMPROVING ACCURACY OF CLASSIFICATION-BASED TEXT DATA PROCESSING

机译:用于提高基于分类的文本数据处理准确性的系统和方法

摘要

An application server for improving the data diversity of a corpus of training data for training a classifier is provided. The application server comprises one or more hardware processors configured to execute a set of instructions to: obtain a first set of text data; determining a set of phrases from the first set of text data; training the classifier using the set of phrases; classify the set of phrases using the trained classifier; and determine a level of accuracy of a classification by the trained classifier with the set of parameters. The application server may retrain the classifier on a second set of text data, or update a set of rules associated with the classifier, in response to determining that the level of accuracy is below a predetermined threshold.
机译:提供了一种用于改善用于训练分类器的训练数据语料库的数据多样性的应用服务器。该应用服务器包括一个或多个硬件处理器,该一个或多个硬件处理器被配置为执行一组指令以:获得第一组文本数据;以及根据第一组文本数据确定一组短语;使用这组短语来训练分类器;使用训练有素的分类器对词组进行分类;并由训练有素的分类器使用该组参数确定分类的准确度。响应于确定准确性水平低于预定阈值,应用服务器可以在第二组文本数据上对分类器进行再训练,或者更新与分类器相关联的一组规则。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号