首页> 外国专利> Feature text string-based sensitive text detecting method and apparatus

Feature text string-based sensitive text detecting method and apparatus

机译:基于特征文本字符串的敏感文本检测方法及装置

摘要

A feature text string of a currently detected text is acquired, and the feature text string is detected according to a finite-state machine established in advance, so as to obtain frequency of occurrence of each keyword in the feature text string. A weight of the keyword category in the text is calculated for each keyword category of multiple keyword categories based on the frequency of occurrence of each keyword corresponding to the keyword category and a preset weight of each keyword. The text is determined to be sensitive when the weight of at least one keyword category is greater than a preset threshold.
机译:获取当前检测到的文本的特征文本串,并根据预先建立的有限状态机对特征文本串进行检测,以获取特征文本串中各个关键词的出现频率。根据与该关键字类别相对应的每个关键字的出现频率和每个关键字的预设权重,为多个关键字类别中的每个关键字类别计算文本中关键字类别的权重。当至少一个关键词类别的权重大于预设阈值时,确定该文本敏感。

著录项

  • 公开/公告号US9710455B2

    专利类型

  • 公开/公告日2017-07-18

    原文格式PDF

  • 申请/专利权人 TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED;

    申请/专利号US201515110541

  • 发明设计人 HONGLIN ZHANG;

    申请日2015-02-11

  • 分类号G10L15/183;G06F17/27;G06F17/22;G06F17/30;

  • 国家 US

  • 入库时间 2022-08-21 13:45:40

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号