Towards a Comprehensive Understanding and Accurate Evaluation of Societal Biases in Pre-Trained Transformers

机译：旨在全面的理解和准确评估预训练的变压器中的社会偏见

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The ease of access to pre-trained transformers has enabled developers to leverage large-scale language models to build exciting applications for their users. While such pre-trained models offer convenient starting points for researchers and developers, there is little consideration for the societal biases captured within these model risking perpetuation of racial, gender, and other harmful biases when these models are deployed at scale. In this paper, we investigate gender and racial bias across ubiquitous pre-trained language models, including GPT-2, XLNet, BERT, RoBERTa. ALBERT and Dis-tilBERT. We evaluate bias within pre-trained transformers using three metrics: WEAT, sequence likelihood, and pronoun ranking. We conclude with an experiment demonstrating the ineffectiveness of word-embedding techniques, such as WEAT. signaling the need for more robust bias testing in transformers.

机译：易于访问预先接收的变形金刚使开发人员能够利用大规模语言模型来为其用户构建令人兴奋的应用程序。虽然这种预先训练的模型为研究人员和开发人员提供方便的起点，但在这些模型中捕获的社会偏差很少考虑在这些模型的危险，当这些模型以比例下部署时，这些模型冒险的危险性冒险。在本文中，我们调查了遍布培训的预先接受的语言模型的性别和种族偏见，包括GPT-2，XLNET，BERT，ROBERTA。阿尔伯特和蒂尔伯特。我们使用三个指标评估预先训练的变压器内的偏差：Weat，Sequiration Lotniule和代词排名。我们通过实验结束，证明了嵌入技术的无效，例如磨损。信号传输在变压器中需要更强大的偏置测试。

著录项

来源
《Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies》|2021年|2383-2389|共7页
会议地点
作者
Andrew Silva; Pradyumna Tambwekar; Matthew Gombolay;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Transfer Learning of Pre-trained Transformers for Covid-19 Hoax Detection in Indonesian Language [J] . Lya Hulliyyatus Suadaa, Ibnu Santoso, Amanda Tabitha Bulan Panjaitan Indonesian Journal of Computing and Cybernetics Systems . 2021,第3期

机译：在印度尼西亚语中的Covid-19 Hoax检测预训练变压器的转移学习
2. Siamese Pre-Trained Transformer Encoder for Knowledge Base Completion [J] . Li Mengyao, Wang Bo, Jiang Jing Neural processing letters . 2021,第6期

机译：暹罗预先培训的变压器编码器，用于知识库完成
3. TWilBert: Pre-trained deep bidirectional transformers for Spanish Twitter [J] . Gonzalez Jose Angel, Hurtado Lluis-F., Pla Ferran Neurocomputing . 2021,第Feba22期

机译：Twilbert：用于西班牙推特的预训练的深双向变压器
4. End-to-End Spoken Language Understanding Using Transformer Networks and Self-Supervised Pre-Trained Features [C] . Edmilson Morais, Hong-Kwang J. Kuo, Samuel Thomas, IEEE International Conference on Acoustics, Speech and Signal Processing . 2021

机译：使用变压器网络和自我监督的预训练功能的端到端口语理解
5. Evaluating and understanding virtual accountability: An exploratory study of human service, arts and culture, and societal benefit nonprofit organizations' virtual accountability in Illinois. [D] . Dumont, Georgette E. 2010

机译：评估和理解虚拟问责制：对伊利诺伊州的人类服务，艺术和文化以及非营利性组织的社会公益性虚拟问责制的探索性研究。
6. Considering the possibilities and pitfalls of Generative Pre-trained Transformer 3 (GPT-3) in healthcare delivery [O] . Diane M. Korngiebel, Sean D. Mooney 2021

机译：考虑到生成的预训练变压器3（GPT-3）在医疗保健交付中的可能性和陷阱
7. Plagiarism in the age of massive Generative Pre-trained Transformers (GPT-3) [O] . N Dehouche 2021

机译：抄袭在大规模生成预训练的变压器（GPT-3）时代
8. Comprehensive Safeguards Evaluation Methods and Societal Risk Analysis [R] . Richardson, J. M. 1982

机译：综合保障评估方法和社会风险分析

Towards a Comprehensive Understanding and Accurate Evaluation of Societal Biases in Pre-Trained Transformers

摘要

著录项

相似文献

相关主题

期刊订阅