Improving historical spelling normalization with bi-directional LSTMs and multi-task learning

机译：用双向LSTMS和多任务学习改善历史拼写规范化

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Natural-language processing of historical documents is complicated by the abundance of variant spellings and lack of annotated data. A common approach is to normalize the spelling of historical words to modern forms. We explore the suitability of a deep neural network architecture for this task, particularly a deep bi-LSTM network applied on a character level. Our model compares well to previously established normalization algorithms when evaluated on a diverse set of texts from Early New High German. We show that multi-task learning with additional normalization data can improve our model's performance further.

机译：通过丰富的变体拼写和缺乏注释数据，历史文档的自然语言处理是复杂的。一种常见的方法是将历史单词的拼写标准化为现代形式。我们探讨了对此任务的深度神经网络架构的适用性，特别是在字符级别应用的深层BI-LSTM网络。我们的模型在从早期新的高德语的各种文本上进行评估时比较良好的归一化算法。我们表明，具有额外归一化数据的多任务学习可以进一步提高模型的性能。

著录项

来源
《International conference on computational linguistics》|2016年|lxxix 673 p.|共9页
会议地点
作者
Marcel Bollmann; Anders S?gaard;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. A bi-directional missing data imputation scheme based on LSTM and transfer learning for building energy data [J] . Ma Jun, Cheng Jack C. P., Jiang Feifeng, Energy and Buildings . 2020,第Juna期

机译：基于LSTM的双向缺失数据估算方案，用于构建能源数据的转移学习
2. Embedded Bi-directional GRU and LSTMLearning Models to Predict Disasterson Twitter Data [J] . A. Bhuvaneswari, J. Timothy Jones Thomas, P. Kesavan Procedia Computer Science . 2019,第5期

机译：嵌入式双向GRU和LSTMLEARIAL INGNED模型，以预测灾害灾害推特数据
3. Learning to Monitor Machine Health with Convolutional Bi-Directional LSTM Networks [J] . Rui Zhao, Ruqiang Yan, Jinjiang Wang, Sensors . 2017,第2期

机译：学习使用卷积双向LSTM网络监控机器健康
4. Improving historical spelling normalization with bi-directional LSTMs and multi-task learning [C] . Marcel Bollmann, Anders Søgaard International conference on computational linguistics . 2016

机译：通过双向LSTM和多任务学习提高历史拼写标准化
5. Deep Learning-Based Hosting Capacity Analysis in LV Distribution Grids with Spatial-Temporal LSTMs [D] . Wu, Jiaqi. 2021

机译：LV分布网的基于深度学习的托管能力分析，具有空间时间LSTMS
6. Deep learning for named entity recognition on Chinese electronic medical records: Combining deep transfer learning with multitask bi-directional LSTM RNN [O] . Xishuang Dong, Shanta Chowdhury, Lijun Qian, 2015

机译：深度学习用于中国电子病历中的命名实体识别：将深度迁移学习与多任务双向LSTM RNN相结合
7. Bi-directional LSTM-CNNs-CRF for Italian Sequence Labeling and Multi-Task Learning [O] . Pierpaolo Basile, Pierluigi Cassotti, Lucia Siciliani, 2017

机译：用于意大利序列标记和多任务学习的双向LSTM-CNNS-CRF

Improving historical spelling normalization with bi-directional LSTMs and multi-task learning

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅