Automatic Indonesian Image Caption Generation using CNN-LSTM Model and FEEH-ID Dataset

机译：使用CNN-LSTM模型和FEEH-ID数据集自动生成印度尼西亚语图像字幕

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Image captioning is a challenge in computer vision research. This paper extends research on automatic image captioning generation in the Indonesian dimension. Description in Indonesian sentences is generated for unlabeled images. The dataset used is FEEH-ID, this is the first Indonesian image captioning dataset. This research is crucial due to unavailability of a corpus for image captioning in Indonesian. This paper will compare the experimental results in the FEEH-ID dataset with English, Chinese and Japanese datasets using the CNN and LSTM models. The performance of the model proposed in the test set provides promising results of 50.0 for the BLEU-1 score and 23.9 for BLEU-3, which is above average of the Bleu evaluation results in other language datasets. The merging model between CNN and LSTM displays pretty good results for the FEEH-ID dataset. The experimental results will be better with a larger dataset.

机译：图像字幕是计算机视觉研究中的一个挑战。本文扩展了对印度尼西亚维自动图像字幕生成的研究。印度尼西亚文句子中的描述是针对未标记图像生成的。使用的数据集是FEEH-ID，这是第一个印度尼西亚图像字幕数据集。由于印尼语中没有用于图像字幕的语料库，因此这项研究至关重要。本文将使用CNN和LSTM模型将FEEH-ID数据集与英语，中文和日语数据集的实验结果进行比较。在测试集中提出的模型的性能为BLEU-1得分提供了50.0的有希望的结果，对于BLEU-3给出了23.9的有希望的结果，高于其他语言数据集中Bleu评估结果的平均值。 CNN和LSTM之间的合并模型对于FEEH-ID数据集显示了相当不错的结果。使用更大的数据集，实验结果将更好。

著录项

来源
《IEEE International Conference on Computational Intelligence and Virtual Environments for Measurement Systems and Applications 》|2019年|1-5|共5页
会议地点
作者
Edy Mulyanto; Esther Irawati Setiawan; Eko Mulyanto Yuniarno; Mauridhi Hery Purnomo;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Dogs; Feature extraction; Decoding; Electrical engineering; Flickr; Logic gates; Task analysis;

机译：狗;特征提取;解码;电气工程; Flickr;逻辑门;任务分析;

相似文献

外文文献
中文文献
专利

1. Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures [J] . Bernardi Raffaella, Cakici Ruket, Elliott Desmond, The Journal of Artificial Intelligence Research . 2016 ,第10期

机译：从图像自动生成描述：模型，数据集和评估措施的调查
2. Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures [J] . Bernardi Raffaella, Cakici Ruket, Elliott Desmond, The Journal of Artificial Intelligence Research . 2016 ,第Null期

机译：从图像自动生成描述：模型，数据集和评估措施的调查
3. Novel model to integrate word embeddings and syntactic trees for automatic caption generation from images [J] . Soft computing: A fusion of foundations, methodologies and applications . 2020 ,第2期

机译：从图像中集成Word Embeddings和Syntactic树的小说模型
4. Automatic Indonesian Image Caption Generation using CNN-LSTM Model and FEEH-ID Dataset [C] . Edy Mulyanto, Esther Irawati Setiawan, Eko Mulyanto Yuniarno, IEEE International Conference on Computational Intelligence and Virtual Environments for Measurement Systems and Applications . 2019

机译：使用CNN-LSTM模型和FEYH-ID数据集自动印度尼西亚图像字幕生成
5. Automatic 3D Building Model Generation by Integrating LiDAR and Aerial Images Using a Hybrid Approach. [D] . Kwak, Eunju. 2013

机译：通过使用混合方法将LiDAR和航拍图集成来自动生成3D建筑模型。
6. Automatic Detection of Obstructive Sleep Apnea Events Using a Deep CNN-LSTM Model [O] . Junming Zhang, Zhen Tang, Jinfeng Gao, 2021

机译：使用深层CNN-LSTM模型自动检测阻塞性睡眠呼吸暂停事件
7. Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures [O] . Bernardi, Raffaella, Cakici, Ruket, Elliott, Desmond, 2017

机译：从图像生成自动描述：模型概述，数据集和评估措施

Automatic Indonesian Image Caption Generation using CNN-LSTM Model and FEEH-ID Dataset

摘要

著录项

相似文献

相关主题

期刊订阅