Knowledge Distillation from Bert in Pre-Training and Fine-Tuning for Polyphone Disambiguation

机译：Bert的语音预训练和微调中的知识提炼以消除歧义词

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Polyphone disambiguation aims to select the correct pronunciation for a polyphonic word from several candidates, which is important for text-to-speech synthesis. Since the pronunciation of a polyphonic word is usually decided by its context, polyphone disambiguation can be regarded as a language understanding task. Inspired by the success of BERT for language understanding, we propose to leverage pre-trained BERT models for polyphone disambiguation. However, BERT models are usually too heavy to be served online, in terms of both memory cost and inference speed. In this work, we focus on efficient model for polyphone disambiguation and propose a two-stage knowledge distillation method that transfers the knowledge from a heavy BERT model in both pre-training and fine-tuning stages to a lightweight BERT model, in order to reduce online serving cost. Experiments on Chinese and English polyphone disambiguation datasets demonstrate that our method reduces model parameters by a factor of 5 and improves inference speed by 7 times, while nearly matches the classification accuracy (95.4% on Chinese and 98.1% on English) to the original BERT model.

机译：复音歧义消除的目的是从多个候选者中为复音词选择正确的发音，这对于文本到语音的合成很重要。由于复音词的发音通常取决于其上下文，因此，将复音词消除歧义可以看作是一种语言理解任务。受BERT在语言理解方面的成功启发，我们建议利用经过预训练的BERT模型来消除多音素歧义。但是，就内存成本和推理速度而言，BERT模型通常太重而无法在线提供。在这项工作中，我们将重点放在用于消除多音素歧义的有效模型上，并提出一种两阶段的知识提炼方法，该方法将知识从预训练和微调阶段的重型BERT模型转移到轻型BERT模型，以减少在线服务费用。在中英文多音素歧义消除数据集上的实验表明，我们的方法将模型参数减少了5倍，推理速度提高了7倍，而分类准确率（中文为95.4％，英文为98.1％）几乎与原始BERT模型相匹配。。

著录项

来源
《IEEE Automatic Speech Recognition and Understanding Workshop》|2019年|168-175|共8页
会议地点
作者
Hao Sun; Xu Tan; Jun-Wei Gan; Sheng Zhao; Dongxu Han; Hongzhi Liu; Tao Qin; Tie-Yan Liu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Bit error rate; Task analysis; Predictive models; Data models; Mathematical model; Training data; Training;

机译：误码率;任务分析;预测模型;数据模型;数学模型;训练数据;训练;

相似文献

外文文献
中文文献
专利

1. Chinese Keyboard Layout Design Based on Polyphone Disambiguation and a Genetic Algorithm [J] . Chen Liao, Pilsung Choe International journal of human-computer interaction . 2013,第4a6期

机译：基于复音歧义和遗传算法的中文键盘布局设计
2. Evaluation of pre-training impact on fine-tuning for remote sensing scene classification [J] . Yuan Man, Liu Zhi, Wang Fan Remote sensing letters . 2019,第1a3期

机译：训练前对微调场景场景分类的影响评估
3. Transfer fine-tuning of BERT with phrasal paraphrases [J] . Yuki Arase, Junichi Tsujii Computer speech and language . 2021,第Mara期

机译：用短语释义转移伯特的微调
4. Knowledge Distillation from Bert in Pre-Training and Fine-Tuning for Polyphone Disambiguation [C] . Hao Sun, Xu Tan, Jun-Wei Gan, IEEE Automatic Speech Recognition and Understanding Workshop . 2019

机译：从培训预训练和微调的伯特蒸馏蒸馏
5. Dictionary-Based Data Generation for Fine-Tuning Bert for Adverbial Paraphrasing Tasks [D] . Carthon, Mark, III. 2020

机译：用于状语释义任务的微调伯爵的文章生成
6. Training Deep Spiking Convolutional Neural Networks With STDP-Based Unsupervised Pre-training Followed by Supervised Fine-Tuning [O] . Chankyu Lee, Priyadarshini Panda, Gopalakrishnan Srinivasan, 2018

机译：通过基于STDP的无监督预训练和有监督的微调来训练深度尖峰卷积神经网络
7. Polyphone Disambiguation for Mandarin Chinese Using Conditional Neural Network with Multi-Level Embedding Features [O] . Zexin Cai, Yaogen Yang, Chuxiong Zhang, 2019

机译：使用多级嵌入功能使用条件神经网络的普通话歧义

Knowledge Distillation from Bert in Pre-Training and Fine-Tuning for Polyphone Disambiguation

摘要

著录项

相似文献

相关主题

期刊订阅