A fine-tuning approach research of pre-trained model with two stage

机译：两阶段预训练模型的微调方法研究

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A Fine-tuning method has been mention in BERT, which is a pre-trained model use widely in NLP. In BERT and GPT, they hold that a standard fine-tuning model should there have a minimal difference between pre-trained architecture and the final downsteam architecture, and the task-special model will harm the result. In this paper, we mention two stream model which use hidden state pre-trained in BERT. In order to facilitate the validity of the verification method, We use sentiment analysis tasks to verify the results, which is a very simple text classification task in natural language process. Experiments on Yelp-review-poliarty show that using the same training data and other fine-tuning method, we can reduce ERROR by 0.21%. With the same setup, we can reduce ERROR of Amazon-review-poliarty by 0.13 %.

机译：BERT中提到了一种微调方法，这是一个在NLP中广泛使用的预先训练的模型。在BERT和GPT中，他们认为，标准的微调模型应该在训练有素的架构和最终下游架构之间存在最小的差异，并且任务 - 特殊模型将损害结果。在本文中，我们提到了两种流模型，该模型使用伯特预先培训的隐藏状态。为了促进验证方法的有效性，我们使用情感分析任务来验证结果，这是自然语言过程中非常简单的文本分类任务。 yelp审查 - 加强的实验表明，使用相同的训练数据和其他微调方法，我们可以将误差减少0.21％。通过相同的设置，我们可以将亚马逊审查的错误减少0.13％。

著录项

来源
《IEEE International Conference on Power Electronics, Computer Applications》|2021年|905-908|共4页
会议地点
作者
Li Zhang; Yuxuan Hu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Sentiment analysis; Bit error rate; Text categorization; Training data; Computer architecture; Task analysis; Standards;

机译：情绪分析;误码率;文本分类;培训数据;计算机架构;任务分析;标准;

相似文献

外文文献
中文文献
专利

1. Fine-tuning Pre-trained Convolutional Neural Networks for Gastric Precancerous Disease Classification on Magnification Narrow-band Imaging Images [J] . Liu Xiaoqi, Wang Chengliang, Bai Jianying, Neurocomputing . 2020,第Juna7期

机译：微调预训练卷曲神经网络，用于胃癌癌癌癌的分类窄带成像图像
2. Fine-tuning of pre-trained convolutional neural networks for diabetic retinopathy screening: a clinical study [J] . Saboora M. Roshan, Ali Karsaz, Amir Hossein Vejdani, International Journal of Computational Science and Engineering . 2020,第4期

机译：用于培训前卷积神经网络的微调糖尿病视网膜病变筛查：临床研究
3. New approach to assess sperm DNA fragmentation dynamics: Fine-tuning mathematical models [J] . Isabel Ortiz, Jesús Dorado, Jane Morrell, Journal of Animal Science and Biotechnology . 2017,第1期

机译：评估精子DNA片段动力学的新方法：微调数学模型
4. Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self-Training Approach [C] . Yue Yu, Simiao Zuo, Haoming Jiang, Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies . 2021

机译：微调预训练的语言模型，监督薄弱：对比正规的自我训练方法
5. Automatic Construction of Conceptual Models to Support Early Stages of Software Development: A Semantic Object Model Approach [D] . Chioasca, Erol-Valeriu. 2015

机译：自动构建概念模型，支持软件开发的早期阶段：语义对象模型方法
6. A general approach for improving deep learning-based medical relation extraction using a pre-trained model and fine-tuning [O] . Tao Chen, Mingfen Wu, Hexi Li 2019

机译：使用预训练模型和微调改善基于深度学习的医学关系提取的通用方法
7. SMART: Robust and Efficient Fine-Tuning for Pre-trained Natural Language Models through Principled Regularized Optimization [O] . Haoming Jiang, Pengcheng He, Weizhu Chen, 2020

机译：SMART：通过原理正常化优化进行预先训练的自然语言模型的强大和高效的微调

A fine-tuning approach research of pre-trained model with two stage

摘要

著录项

相似文献

相关主题

期刊订阅