Elbert: Fast Albert with Confidence-Window Based Early Exit

机译：ELBERT：基于信心窗口的早期出口快速艾伯特

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Despite the great success in Natural Language Processing (NLP) area, large pre-trained language models like BERT are not well-suited for resource-constrained or real-time applications owing to the large number of parameters and slow inference speed. Recently, compressing and accelerating BERT have become important topics. By incorporating a parameter-sharing strategy, ALBERT greatly reduces the number of parameters while achieving competitive performance. Nevertheless, ALBERT still suffers from a long inference time. In this work, we propose the ELBERT, which significantly improves the average inference speed compared to ALBERT due to the proposed confidence-window based early exit mechanism, without introducing additional parameters or extra training overhead. Experimental results show that ELBERT achieves an adaptive inference speedup varying from 2× to 10× with negligible accuracy degradation compared to AL-BERT on various datasets. Besides, ELBERT achieves higher accuracy than existing early exit methods used for accelerating BERT under the same computation cost. Furthermore, to understand the principle of the early exit mechanism, we also visualize the decision-making process of it in ELBERT. Our code is publicly available online.¹

机译：尽管在自然语言处理（NLP）区域取得了巨大成功，但由于大量参数和慢速推理速度，伯特的大型预训练语言模型也不适合资源受限或实时应用。最近，压缩和加速BERT已成为重要的主题。通过结合参数共享策略，Albert大大减少了参数的数量，同时实现了竞争性能。尽管如此，Albert仍然遭受了长期推理的时间。在这项工作中，我们提出了ELBERT，它由于所提出的基于置信窗的早期退出机制而显着提高了与Albert相比的平均推理速度，而不会引入额外的参数或额外训练开销。实验结果表明，ELBERT与各种数据集上的AL-BERT相比，从2倍变化，从2倍实现的自适应推理加速。此外，ELBERT比在相同的计算成本下加速硼的现有早期退出方法实现更高的精度。此外，要了解早期退出机制的原理，我们还将其视为ELBERT的决策过程。我们的代码在线公开提供。¹

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2021年|7713-7717|共5页
会议地点
作者
Keli Xie; Siyuan Lu; Meiqi Wang; Zhongfeng Wang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Training; Degradation; Visualization; Bit error rate; Decision making; Signal processing; Natural language processing;

机译：培训;降级;可视化;误码率;决策;信号处理;自然语言处理;

相似文献

外文文献
中文文献
专利

1. Cliffs to exit Australia faster to serve US base [J] . SBB Steel Markets Daily . 2018,第18期

机译：悬崖退出澳大利亚更快地服务于美国基地
2. Finding Exits and Voices: Albert Hirschman's Contribution to the Study of Public Services [J] . John Peter International Public Management Journal . 2017,第3期

机译：寻找出路和声音：阿尔伯特·赫希曼对公共服务研究的贡献
3. Schumpeter: Exit Albert Hirschman [J] . The economist . 2013,第8816期

机译：熊彼特：退出Albert Hirschman
4. LineageBA: A Fast, Exact and Scalable Graph Generation for the Barabási-Albert Model [C] . Himchan Park, Min-Soo Kim International Conference on Data Engineering . 2021

机译：LineaGeBA：Barabási-Albert模型的快速，精确和可扩展的图表生成
5. Fast track program evaluation of a new nursing curriculum incorporating student characteristics, Institute of Medicine core competencies, HESI Exit Exam, and NCLEX-RN results [D] . Morris, Tama L. 2008

机译：结合学生特点，医学研究所核心能力，HESI入学考试和NCLEX-RN结果的新护理课程的快速课程评估
6. Ribosome Provisioning Activates a Bistable Switch Coupled to Fast Exit from Stationary Phase [O] . Philippe Remigi, Gayle C Ferguson, Ellen McConnell, -1

机译：核糖体供应激活了一个与稳态阶段快速退出结合的双稳态开关
7. The citizen's voice : Albert Hirschman's Exit, Voice and Loyalty and its contribution to media citizenship debates [O] . Flew Terry 2009

机译：公民之声：阿尔伯特·赫希曼的退出，声音和忠诚及其对媒体公民辩论的贡献
8. Comparison of the Target-Thickness Dependence of the Convoy Electron Yield and the Rydberg Electron Yield Measured in Coincidence with Exit Charge States in Fast Ion-Solid Collisions. [R] . Gaither, C. C., Breinig, M., Freyou, J., 1988

机译：快速离子 - 固体碰撞中条形电子产额和里德伯格电子产额与出口电荷状态一致的目标厚度依赖性比较。

Elbert: Fast Albert with Confidence-Window Based Early Exit

摘要

著录项

相似文献

相关主题

期刊订阅