首页> 外文会议>International Conference on Informatics and Computing >Meme Opinion Categorization by Using Optical Character Recognition (OCR) and Naïve Bayes Algorithm
【24h】

Meme Opinion Categorization by Using Optical Character Recognition (OCR) and Naïve Bayes Algorithm

机译:使用光学字符识别(OCR)和朴素贝叶斯算法的Meme意见分类

获取原文

摘要

Generally, a meme is an image which is produced by the society which is used to comment a certain event, followed with a particular template from the decent online images. The distribution of meme becomes the phenomenon and very popular in the last few years. The problem is, some of these memes contain negative contents to harm others. One type of meme image that trends in social media is aimed at government, either to support the performance of the government or to insinuate and dislike of a government. Therefore, Political view by the citizen can be identified by viral memes on the internet. The aim of this research is classifying the types of a meme by applying image processing and OCR Tesseract which are combined with Naïve Bayes Algorithm. OCR Tesseract is required to recognize text in an image, meanwhile Naive Bayes algorithm which is used to find the highest probability to classify the testing dataset into the correct category. This research uses a meme as the dataset. The result is meme which is successfully classified. The accuracy depends on the OCR result which utilizes tesseract engine.
机译:通常,模因是社会产生的图像,用于评论某个事件,然后是来自体面的在线图像的特定模板。模因的分布已成为一种现象,并且在最近几年非常流行。问题是,这些模因中的某些包含负面内容以伤害其他人。社交媒体趋势的一种模因图像是针对政府的,以支持政府的绩效或暗示和不喜欢政府。因此,可以通过互联网上的病毒性模因来识别公民的政治观点。这项研究的目的是通过与朴素贝叶斯算法相结合的图像处理和OCR Tesseract对模因类型进行分类。 OCR Tesseract需要识别图像中的文本,与此同时,朴素贝叶斯算法(Naive Bayes algorithm)用于查找将测试数据集分类为正确类别的最高概率。本研究使用模因作为数据集。结果是模因,它已成功分类。精度取决于利用tesseract引擎的OCR结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号