FastSeq: Make Sequence Generation Faster

机译：FastSeq：更快地制作序列生成

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Transformer-based models have made tremendous impacts in natural language generation. However the inference speed is a bottleneck due to large model size and intensive computing involved in auto-regressive decoding process. We develop FastSeq framework to accelerate sequence generation without accuracy loss. The proposed optimization techniques include an attention cache optimization, an efficient algorithm for detecting repeated n-grams, and an asynchronous generation pipeline with parallel I/O. These optimizations are general enough to be applicable to Transformer-based models (e.g., T5, GPT2, and UniLM). Our benchmark results on a set of widely used and diverse models demonstrate 4-9x inference speed gain. Additionally. FastSeq is easy to use with a simple one-line code change.

机译：基于变压器的模型对自然语言生成产生了巨大的影响。然而推理速度是由于大型模型尺寸和自动回归解码过程中涉及的大型模型大小和密集计算导致的瓶颈。我们开发FastSeq框架以加速序列生成而无需精确损失。所提出的优化技术包括注意高速缓存优化，一种用于检测重复的n-gram的有效算法，以及具有并行I / O的异步产生流水线。这些优化通常足以适用于基于变压器的模型（例如，T5，GPT2和Unilm）。我们的基准导致一套广泛使用和多样化的模型展示了4-9倍推理速度增益。此外。 FastSeq易于使用简单的单行代码更改。

著录项

来源
《Annual Meeting of the Association for Computational Linguistics;International Joint Conference on Natural Language Processing》|2021年|218-226|共9页
会议地点
作者
Yu Yan; Fei Hu; Jiusheng Chen; Nikhil Bhendawade; Ting Ye; Yeyun Gong; Nan Duan; Desheng Cui; Bingyu Chi; Ruifei Zhang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. OLGA: fast computation of generation probabilities of B- and T-cell receptor amino acid sequences and motifs [J] . Sethna Zachary, Elhanati Yuval, Callan Curtis G. Jr., Bioinformatics . 2019,第17期

机译：olga：快速计算B-和T细胞受体氨基酸序列和基序的发电概率
2. Generation of Arbitrary Sequences of Ultrafast Pulses With Integrated-Optic Space-to-Time Optical Processors and Phase-Only Masks [J] . Krishnan A., Grave de Peralta L., Kuryatkov V., IEEE Photonics Technology Letters . 2007,第期

机译：利用集成光学空时光学处理器和仅相位掩模生成超快脉冲的任意序列
3. Generation of Ultrafast Pulse Sequences With Arrayed Waveguide Grating Multiplexers Subjected to Modulated External Stress [J] . A. Krishnan, L. Grave de Peralta, H. Temkin, IEEE Photonics Technology Letters . 2006,第10期

机译：阵列式光栅多路复用器在调制外应力作用下产生超快脉冲序列
4. An Ultra-Fast Computing Pipeline for Metagenome Analysis with Next-Generation DNA Sequencers [C] . Suzuki Shuji, Ishida Takashi, Akiyama Yutaka 2012 SC Companion: High Performance Computing, Networking, Storage and Analysis. . 2012

机译：新一代DNA测序仪用于超基因组分析的超快速计算管道
5. Fast measurement of heart motion using MRI: Systems, sequences, and algorithms. [D] . Abd-Elmoniem, Khaled Z. 2008

机译：使用MRI快速测量心脏运动：系统，序列和算法。
6. A Deep-Sequencing Workflow for the Fast and Efficient Generation of High-Quality African Swine Fever Virus Whole-Genome Sequences [O] . Jan H. Forth, Leonie F. Forth, Jacqueline King, 2019

机译：快速高效生成高质量非洲猪瘟病毒全基因组序列的深度测序工作流程
7. A fast, scalable, MinHash-based k-mer tool to assess Sequence Read Archive next generation sequence submissions. [O] . Kenneth S Katz, Oleg Shutov, Richard Lapoint, 2021

机译：基于快速，可扩展的Minhash的K-MET工具，用于评估序列读取存档的下一代序列提交。
8. Generation of Fast Pseudo-Random Binary Sequences at High Bit Rates [R] . Quan, A. Y. 1972

机译：以高比特率生成快速伪随机二进制序列

FastSeq: Make Sequence Generation Faster

摘要

著录项

相似文献

相关主题

期刊订阅