Training a Neural Network in a Low-Resource Setting on Automatically Annotated Noisy Data

机译：在资源不足的情况下根据自动注释的噪声数据训练神经网络

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Manually labeled corpora are expensive to create and often not available for low-resource languages or domains. Automatic labeling approaches are an alternative way to obtain labeled data in a quicker and cheaper way. However, these labels often contain more errors which can deteriorate a classifier's performance when trained on this data. We propose a noise layer that is added to a neural network architecture. This allows modeling the noise and train on a combination of clean and noisy data. We show that in a low-resource NER task we can improve performance by up to 35% by using additional, noisy data and handling the noise.

机译：手动标记的语料库创建起来很昂贵，而且通常不适用于资源匮乏的语言或域。自动标记方法是一种以更快，更便宜的方式获取标记数据的替代方法。但是，这些标签通常包含更多错误，当根据该数据进行训练时，这些错误可能会降低分类器的性能。我们建议将噪声层添加到神经网络体系结构中。这样就可以对噪声进行建模，并结合干净和嘈杂的数据进行训练。我们表明，在低资源的NER任务中，我们可以通过使用其他嘈杂的数据并处理噪声来将性能提高35％。

著录项

来源
《Workshop on deep learning approaches for low-resource natural language processing 2018》|2018年|12-18|共7页
会议地点 Melbourne(AU)
作者
Michael A. Hedderich; Dietrich Klakow;
展开▼
作者单位

Spoken Language Systems (LSV),Saarbrucken Graduate School of Computer Science Saarland Informatics Campus, Saarland University, Saarbruecken, Germany;

Spoken Language Systems (LSV);

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. A hybrid approach for training recurrent neural networks: application to multi-step-ahead prediction of noisy and large data sets [J] . S. Chtourou, M. Chtourou, O. Hammami Neural computing & applications . 2008,第3期

机译：训练递归神经网络的混合方法：应用于嘈杂和大数据集的多步提前预测
2. Training neural networks with noisy data as an ill-posed problem [J] . Martin Burger, Heinz W. Engl Advances in computational mathematics . 2000,第4期

机译：用嘈杂的数据作为不适定问题来训练神经网络
3. Performance of an Artificial Neural Network Model for Simulating Saltwater Intrusion Process in Coastal Aquifers when Training with Noisy Data [J] . Rajib Kumar Bhattacharjya, Bithin Datta, Mysore G. Satish KSCE journal of civil engineering . 2009,第3期

机译：含噪声数据训练时模拟人工神经网络模型对沿海含水层咸水入侵过程的性能
4. Training a Neural Network in a Low-Resource Setting on Automatically Annotated Noisy Data [C] . Michael A. Hedderich, Dietrich Klakow Annual meeting of the Association for Computational Linguistics . 2018

机译：在自动注释的嘈杂数据上培训在低资源设置中的神经网络
5. Detection of Small Synaptic Signals in Noisy Electrophysiological Data by Means of Artificial Neural Networks [D] . Marino, Marc Joseph. 2020

机译：通过人工神经网络检测噪声电生理数据中的小突触信号
6. Deep neural networks show an equivalent and often superior performance to dermatologists in onychomycosis diagnosis: Automatic construction of onychomycosis datasets by region-based convolutional deep neural network [O] . Seung Seog Han, Gyeong Hun Park, Woohyung Lim, -1

机译：深度神经网络在灰指甲诊断中显示出与皮肤科医生相当且通常优于皮肤病的性能：通过基于区域的卷积深度神经网络自动构建灰指甲数据集
7. Training a Neural Network in a Low-Resource Setting on Automatically Annotated Noisy Data [O] . Michael A. Hedderich, Dietrich Klakow 2018

机译：在自动注释的嘈杂数据上培训在低资源设置中的神经网络

Training a Neural Network in a Low-Resource Setting on Automatically Annotated Noisy Data

摘要

著录项

相似文献

相关主题

期刊订阅