首页> 外国专利> JUNK TEXT IDENTIFICATION METHOD AND DEVICE, COMPUTING DEVICE AND READABLE STORAGE MEDIUM

JUNK TEXT IDENTIFICATION METHOD AND DEVICE, COMPUTING DEVICE AND READABLE STORAGE MEDIUM

机译:垃圾文本识别方法和设备,计算设备和可读存储介质

摘要

Disclosed by the present invention is a junk text identification method and device, a computing device and a readable storage medium. One method embodiment comprises the steps of: dividing a text to be recognized so as to obtain a division resu generating a feature vector for the text to be recognized on the basis of the division resu inputting the feature vector into a plurality of first classification models so as to obtain outputs of the plurality of first classification models, the first classification model comprising a linear classification model and a deep learning classification model; at least combining the outputs of the plurality of first classification models, so as to obtain a combined vector; and using a second classification model to determine whether the text to be recognized is junk text according to the combined vector. Also disclosed by the present invention are a corresponding junk text identification device, a computing device and a readable storage medium.
机译:本发明公开了一种垃圾文本识别方法和设备,计算设备和可读存储介质。一个方法实施例包括以下步骤:分割要识别的文本以获得分割结果;根据分割结果为待识别的文本生成特征向量;将特征向量输入多个第一分类模型,以获得多个第一分类模型的输出,所述第一分类模型包括线性分类模型和深度学习分类模型;至少将多个第一分类模型的输出进行组合,以获得组合矢量;使用第二分类模型,根据组合矢量,确定待识别文本是否为垃圾文本。本发明还公开了相应的垃圾文本识别设备,计算设备和可读存储介质。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号