首页> 外文会议>International Conference on Informatics and Computing >Meme Opinion Categorization by Using Optical Character Recognition (OCR) and Na?ve Bayes Algorithm
【24h】

Meme Opinion Categorization by Using Optical Character Recognition (OCR) and Na?ve Bayes Algorithm

机译:使用光学字符识别(OCR)和NA ve贝雷斯算法的MEME意见分类

获取原文

摘要

Generally, a meme is an image which is produced by the society which is used to comment a certain event, followed with a particular template from the decent online images. The distribution of meme becomes the phenomenon and very popular in the last few years. The problem is, some of these memes contain negative contents to harm others. One type of meme image that trends in social media is aimed at government, either to support the performance of the government or to insinuate and dislike of a government. Therefore, Political view by the citizen can be identified by viral memes on the internet. The aim of this research is classifying the types of a meme by applying image processing and OCR Tesseract which are combined with Na?ve Bayes Algorithm. OCR Tesseract is required to recognize text in an image, meanwhile Naive Bayes algorithm which is used to find the highest probability to classify the testing dataset into the correct category. This research uses a meme as the dataset. The result is meme which is successfully classified. The accuracy depends on the OCR result which utilizes tesseract engine.
机译:通常,MEME是由社会产生的图像,该社会用于评论某个事件,其次是来自体面在线图像的特定模板。 MEME的分布成为过去几年中的现象和非常流行。问题是,其中一些模因包含负面内容来伤害他人。一种类型的MEME图像,即社会媒体的趋势旨在支持政府的表现或暗示和不喜欢政府。因此,可以通过互联网上的病毒模因来确定公民的政治观点。本研究的目的是通过应用与Na ve贝叶斯算法组合的图像处理和OCR TESERACT来分类MEME的类型。 ocr tesseract是识别图像中的文本所必需的,同时朴素的贝叶斯算法用于找到将测试数据集分类为正确类别的最高概率。该研究使用MEME作为数据集。结果是成功分类的MEME。准确性取决于利用TESSERACT发动机的OCR结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号