首页> 美国卫生研究院文献>PeerJ Computer Science >Feature-based detection of automated language models: tackling GPT-2 GPT-3 and Grover

【2h】

Feature-based detection of automated language models: tackling GPT-2 GPT-3 and Grover

机译：基于功能的自动语言模型检测：解决GPT-2GPT-3和格罗弗

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

The recent improvements of language models have drawn much attention to potential cases of use and abuse of automatically generated text. Great effort is put into the development of methods to detect machine generations among human-written text in order to avoid scenarios in which the large-scale generation of text with minimal cost and effort undermines the trust in human interaction and factual information online. While most of the current approaches rely on the availability of expensive language models, we propose a simple feature-based classifier for the detection problem, using carefully crafted features that attempt to model intrinsic differences between human and machine text. Our research contributes to the field in producing a detection method that achieves performance competitive with far more expensive methods, offering an accessible “first line-of-defense” against the abuse of language models. Furthermore, our experiments show that different sampling methods lead to different types of flaws in generated text.

机译：最近的语言模型的改进已经引起了对潜在的使用情况和自动生成的文本的潜在案例。为了避免具有最小成本和努力的大规模文本的情景，努力开发用于检测人类代代的方法，以避免具有最小成本和努力的大规模发电，破坏了在线人类互动和事实信息的信任。虽然大多数目前的方法依赖于昂贵的语言模型的可用性，但我们提出了一个简单的基于特征的分类器，用于检测问题，使用仔细制作的功能，该功能试图模拟人类和机器文本之间的内部差异。我们的研究有助于制作一种检测方法，实现具有更昂贵的方法竞争的检测方法，为滥用语言模型提供了可访问的“第一章”。此外，我们的实验表明，不同的抽样方法导致生成文本中的不同类型的缺陷。

著录项

期刊名称 PeerJ Computer Science
作者
Leon Fröhling; Arkaitz Zubiaga;
展开▼
作者单位

展开▼
年(卷),期 2021(-1),-1
年度 2021
页码 -1
总页数 23
原文格式 PDF
正文语种
中图分类
关键词

机译：基于特征的检测;语言模型;语言生成;文本分类;NLP;

相似文献

外文文献
中文文献
专利

1. Feature-based detection of automated language models: tackling GPT-2, GPT-3 and Grover [J] . Leon Fr?hling, Arkaitz Zubiaga PeerJ Computer Science . 2021,第a期

机译：基于功能的自动语言模型检测：解决GPT-2，GPT-3和格罗弗
2. Automated Cloud Detection of Satellite Imagery Using Spatial Modeler Language and ERDAS Macro Language [J] . Teerawong Laosuwan, Singthong Pattanasethanon, Worawat Sa-ngiamvibool IETE Technical Review . 2013,第3期

机译：使用空间建模器语言和ERDAS宏语言对卫星图像进行云自动检测
3. A Probabilistic Feature-Based Parsing Model for Head-Final Languages [J] . So-Young PARK, Yong-Jae KWAK, Joon-Ho LIM, IEICE Transactions on Information and Systems . 2004,第12期

机译：最终语言的基于概率特征的解析模型
4. Hello, It's GPT-2 - How Can I Help You? Towards the Use of Pretrained Language Models for Task-Oriented Dialogue Systems [C] . Pawel Budzianowski, IvanVulic Workshop on neural generation and translation . 2019

机译：您好，这是GPT-2-我该如何帮助您？在面向任务的对话系统中使用预训练的语言模型
5. Discriminative Articulatory Feature-based Pronunciation Models with Application to Spoken Term Detection [D] . Prabhavalkar, Rohit. 2013

机译：基于区分性发音特征的语音模型及其在口语检测中的应用
6. From the Cover: Automated reconstruction of ancient languages using probabilistic models of sound change [O] . Alexandre Bouchard-Côté, David Hall, Thomas L. Griffiths, 2013

机译：从封面开始：使用声音变化的概率模型自动重建古代语言
7. Feature-based detection of automated language models: tackling GPT-2, GPT-3 and Grover [O] . Leon Fröhling, Arkaitz Zubiaga 2021

机译：基于功能的自动语言模型检测：解决GPT-2，GPT-3和格罗弗
8. Sociolinguistically Informed Natural Language Processing: Automating Irony Detection. [R] . Wallace, B. C., Beaver, D. 2017

机译：社会语言学知情自然语言处理：自动化反讽检测。

Feature-based detection of automated language models: tackling GPT-2 GPT-3 and Grover

摘要

著录项

相似文献

相关主题

期刊订阅