首页> 外国专利> METHOD AND DEVICE FOR CLASSIFYING UNIFORM RESOURCE LOCATORS BASED ON CONTENT IN CORRESPONDING WEBSITES

METHOD AND DEVICE FOR CLASSIFYING UNIFORM RESOURCE LOCATORS BASED ON CONTENT IN CORRESPONDING WEBSITES

机译:基于对应网站内容的统一资源定位器的方法和装置

摘要

A method and device for classifying uniform resource locators based on content in corresponding websites is disclosed. The method includes extracting, by a network device, a plurality of website contents from a website associated with a URL based on Optical Character Recognition (OCR). The method further includes classifying, by the network device, each of the plurality of website contents into a plurality of webpage categories based on machine learning. The method includes simulating, by the network device, user actions for the plurality of website contents, based on a webpage category associated with each of the plurality of website contents. The method further includes determining, by the network device, an access classification for the URL based on results of simulating the user actions and machine learning.
机译:公开了一种基于相应网站中的内容对统一资源定位符进行分类的方法和装置。该方法包括由网络设备基于光学字符识别(OCR)从与URL相关联的网站中提取多个网站内容。该方法进一步包括由网络设备基于机器学习将多个网站内容中的每一个分类为多个网页类别。该方法包括由网络设备基于与多个网站内容中的每一个相关联的网页类别来模拟针对多个网站内容的用户动作。该方法还包括由网络设备基于模拟用户动作和机器学习的结果来确定URL的访问分类。

著录项

  • 公开/公告号EP3561708A1

    专利类型

  • 公开/公告日2019-10-30

    原文格式PDF

  • 申请/专利权人 WIPRO LIMITED;

    申请/专利号EP20180180747

  • 发明设计人 GOVARDHAN SRIDHAR;VARKEY SUNIL;

    申请日2018-06-29

  • 分类号G06F21/55;G06F17/30;G06F21/57;G06N99;H04L29/06;G06N3/10;G06K9;

  • 国家 EP

  • 入库时间 2022-08-21 12:26:57

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号