Improving historical spelling normalization with bi-directional LSTMs and multi-task learning

机译：通过双向LSTM和多任务学习提高历史拼写标准化

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Natural-language processing of historical documents is complicated by the abundance of variant spellings and lack of annotated data. A common approach is to normalize the spelling of historical words to modern forms. We explore the suitability of a deep neural network architecture for this task, particularly a deep bi-LSTM network applied on a character level. Our model compares well to previously established normalization algorithms when evaluated on a diverse set of texts from Early New High German. We show that multi-task learning with additional normalization data can improve our model's performance further.

机译：丰富的拼写形式和缺少注释的数据使历史文档的自然语言处理变得复杂。一种常见的方法是将历史单词的拼写标准化为现代形式。我们探索了深层神经网络体系结构适合此任务的适用性，尤其是在字符级别应用的深层bi-LSTM网络。我们的模型与早期建立的归一化算法进行了很好的比较，该算法是根据早期新高德文的各种文本进行评估的。我们表明，使用其他归一化数据进行的多任务学习可以进一步改善模型的性能。

著录项

来源
《International conference on computational linguistics》|2016年|131-139|共9页
会议地点
作者
Marcel Bollmann; Anders Søgaard;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. A bi-directional missing data imputation scheme based on LSTM and transfer learning for building energy data [J] . Ma Jun, Cheng Jack C. P., Jiang Feifeng, Energy and Buildings . 2020,第Juna期

机译：基于LSTM的双向缺失数据估算方案，用于构建能源数据的转移学习
2. Embedded Bi-directional GRU and LSTMLearning Models to Predict Disasterson Twitter Data [J] . A. Bhuvaneswari, J. Timothy Jones Thomas, P. Kesavan Procedia Computer Science . 2019,第5期

机译：嵌入式双向GRU和LSTMLEARIAL INGNED模型，以预测灾害灾害推特数据
3. Learning to Monitor Machine Health with Convolutional Bi-Directional LSTM Networks [J] . Rui Zhao, Ruqiang Yan, Jinjiang Wang, Sensors . 2017,第2期

机译：学习使用卷积双向LSTM网络监控机器健康
4. Improving historical spelling normalization with bi-directional LSTMs and multi-task learning [C] . Marcel Bollmann, Anders S?gaard International conference on computational linguistics . 2016

机译：用双向LSTMS和多任务学习改善历史拼写规范化
5. Deep Learning-Based Hosting Capacity Analysis in LV Distribution Grids with Spatial-Temporal LSTMs [D] . Wu, Jiaqi. 2021

机译：LV分布网的基于深度学习的托管能力分析，具有空间时间LSTMS
6. Deep learning for named entity recognition on Chinese electronic medical records: Combining deep transfer learning with multitask bi-directional LSTM RNN [O] . Xishuang Dong, Shanta Chowdhury, Lijun Qian, 2015

机译：深度学习用于中国电子病历中的命名实体识别：将深度迁移学习与多任务双向LSTM RNN相结合
7. Bi-directional LSTM-CNNs-CRF for Italian Sequence Labeling and Multi-Task Learning [O] . Pierpaolo Basile, Pierluigi Cassotti, Lucia Siciliani, 2017

机译：用于意大利序列标记和多任务学习的双向LSTM-CNNS-CRF

Improving historical spelling normalization with bi-directional LSTMs and multi-task learning

摘要

著录项

相似文献

相关主题

期刊订阅