首页> 外国专利> REAL TIME LEARNING OF TEXT CLASSIFICATION MODELS FOR FAST AND EFFICIENT LABELING OF TRAINING DATA AND CUSTOMIZATION

REAL TIME LEARNING OF TEXT CLASSIFICATION MODELS FOR FAST AND EFFICIENT LABELING OF TRAINING DATA AND CUSTOMIZATION

机译:文本分类模型的实时学习,可快速高效地分配训练数据和自定义

摘要

Techniques for real-time generation and customization of text classification models. An initial dataset of input text samples are manually assigned labels, and the labeled input text samples are tokenized and provided as training data to train machine learning classifiers for various classes or categories of the input text samples. As the machine learning classifiers train with the training data, feedback in the form of suggestions (or predictions) are provided in real time by the text classification models regarding which label(s) to assign to any input text sample(s) currently in the training data or any new input text sample(s) further provided as training data for the respective machine learning classifiers. The suggested (or predicted) label(s) can be manually assigned to the input text sample(s), if deemed appropriate, and the newly labeled input text sample(s) can be provided to supplement the existing training data for the respective machine learning classifiers.
机译:实时生成和自定义文本分类模型的技术。输入文本样本的初始数据集是手动分配的标签,标记的输入文本样本被标记化并作为训练数据来训练输入文本样本的各种类或类别的机器学习分类器。当机器学习分类器使用训练数据进行训练时,文本分类模型会实时提供建议(或预测)形式的反馈,这些反馈涉及关于将哪些标签分配给当前输入文本中的任何标签。训练数据或任何新的输入文本样本,进一步提供作为各个机器学习分类器的训练数据。如果认为合适,则可以将建议的(或预测的)标签手动分配给输入文本样本,并且可以提供新标记的输入文本样本以补充各个机器的现有训练数据。学习分类器。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号