ACORT: A compact object relation transformer for parameter efficient image captioning

Tan Jia Huei; Tan Ying Hua; Chan Chee SengChuah Joon Huang

首页> 外文期刊>Neurocomputing >ACORT: A compact object relation transformer for parameter efficient image captioning

【24h】

ACORT: A compact object relation transformer for parameter efficient image captioning

机译：ACORT: A compact object relation transformer for parameter efficient image captioning

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

Recent research that applies Transformer-based architectures to image captioning has resulted in stateof-the-art image captioning performance, capitalising on the success of Transformers on natural language tasks. Unfortunately, though these models work well, one major flaw is their large model sizes. To this end, we present three parameter reduction methods for image captioning Transformers: Radix Encoding, cross-layer parameter sharing, and attention parameter sharing. By combining these methods, our proposed ACORT models have 3.7x to 21.6x fewer parameters than the baseline model without compromising test performance. Results on the MS-COCO dataset demonstrate that our ACORT models are competitive against baselines and SOTA approaches, with CIDEr score P126. Finally, we present qualitative results and ablation studies to demonstrate the efficacy of the proposed changes further. Code and pre-trained models are publicly available at https://github.com/jiahuei/sparse-image-captioning. (c) 2022 Published by Elsevier B.V.

著录项

来源
《Neurocomputing》 |2022年第14期|60-72|共13页
作者
Tan Jia Huei; Tan Ying Hua; Chan Chee SengChuah Joon Huang;
展开▼
作者单位

Univ Malaya, Fac Comp Sci & Informat Technol, CISiP, Kuala Lumpur 50603, Malaysia;

Univ Malaya, Fac Engn, Kuala Lumpur 50603, Malaysia;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种英语
中图分类
关键词
Image captioning; Deep network compression; Deep learning;

ACORT: A compact object relation transformer for parameter efficient image captioning

摘要

著录项

相关主题

期刊订阅