首页> 外国专利> / Personal information detecting-filtering system and method for reducing load of irregular image files in homepage

/ Personal information detecting-filtering system and method for reducing load of irregular image files in homepage

机译:/减少主页中不规则图像文件负荷的个人信息检测过滤系统及方法

摘要

The present invention relates to a system for detecting/blocking personal information of an atypical image file in a homepage and a method for reducing load, in particular, while detecting whether personal information is exposed from an atypical image file constituting the homepage, text extraction is impossible As to reduce the load on the diagnostic server by excluding small or overlapping atypical image files, and to more accurately detect whether personal information is exposed from repeatedly extracted text while variously changing the rotation angle, saturation, and brightness of the image files, An image file collection step (S10) of collecting image files from the content; For the image file collected in the image file collection step (S10), the size of the image file is determined to remove unnecessary image files, and the image file below the reference capacity is deleted, and unique values of the image file are generated to exclude duplicate image files. An image file processing step (S20) of loading an existing stored detection result with respect to a duplicate image file by comparing with an existing stored eigenvalue; In the image file processing step (S20), the image file correction step (S30) of extracting the text repeatedly while changing the rotation angle, brightness, and saturation of the image file from the non-overlapping image file, and combining them into one text; It includes a personal information exposure determination step (S40) for detecting whether or not personal information is exposed for the text integrated in the image file correction step (S30).
机译:技术领域本发明涉及一种用于检测/阻止主页中的非典型图像文件的个人信息的系统以及用于减轻负荷的方法,特别是,在检测是否从构成主页的非典型图像文件中暴露了个人信息的同时,进行文本提取。不可能通过排除小的或重叠的非典型图像文件来减轻诊断服务器的负担,并在各种改变图像文件的旋转角度,饱和度和亮度的同时,更准确地检测是否从重复提取的文本中暴露出个人信息,图像文件收集步骤(S10),从内容中收集图像文件;对于在图像文件收集步骤(S10)中收集的图像文件,确定图像文件的大小以去除不必要的图像文件,并且删除参考容量以下的图像文件,并生成该图像文件的唯一值以排除重复的图像文件。图像文件处理步骤(S20),其通过与现有的存储特征值进行比较来加载针对复制图像文件的现有存储的检测结果;在图像文件处理步骤(S20)中,图像文件校正步骤(S30)在从非重叠图像文件改变图像文件的旋转角度,亮度和饱和度的同时重复提取文本,并将它们组合为一个文本;它包括个人信息暴露确定步骤(S40),用于检测是否针对在图像文件校正步骤(S30)中集成的文本暴露了个人信息。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号