首页> 外国专利> Out-of-domain sentence detection

Out-of-domain sentence detection

机译:域名句子检测

摘要

A computer-implemented method includes obtaining a training data set including text data indicating one or more phrases or sentences. The computer-implemented method includes training a classifier using supervised machine learning based on the training data set and additional text data indicating one or more out-of-domain phrases or sentences. The computer-implemented method includes training an autoencoder using unsupervised machine learning based on the training data. The computer-implemented method further includes combining the classifier and the autoencoder to generate the out-of-domain sentence detector configured to generate an output indicating a classification of whether input text data corresponds to an out-of-domain sentence. The output is based on a combination of a first output of the classifier and a second output of the autoencoder.
机译:计算机实现的方法包括获得包括指示一个或多个短语或句子的文本数据的训练数据集。计算机实现的方法包括基于训练数据集的使用监督机器学习和指示一个或多个域外短语或句子的附加文本数据来训练分类器。计算机实现的方法包括使用基于训练数据的无监督机器学习培训AutoEncoder。计算机实现的方法还包括组合分类器和AutoEncoder以生成域名句子检测器,被配置为生成指示输入文本数据是否对应于域名句子的分类的输出。输出基于分类器的第一输出和AutoEncoder的第二输出的组合。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号