Modeling Noisiness to Recognize Named Entities using Multitask Neural Networks on Social Media

机译：在社交媒体上使用多任务神经网络对噪声进行建模以识别命名实体

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Recognizing named entities in a document is a key task in many NLP applications. Although current state-of-the-art approaches to this task reach a high performance on clean text (e.g. newswire genres), those algorithms dramatically degrade when they are moved to noisy environments such as social media domains. We present two systems that address the challenges of processing social media data using character-level phonetics and phonology, word embeddings, and Part-of-Speech tags as features. The first model is a multitask end-to-end Bidirectional Long Short-Term Memory (BLSTM)-Conditional Random Field (CRF) network whose output layer contains two CRF classifiers. The second model uses a multitask BLSTM network as feature extractor that transfers the learning to a CRF classifier for the final prediction. Our systems outperform the current Fl scores of the state of the art on the Workshop on Noisy User-generated Text 2017 dataset by 2.45% and 3.69%, establishing a more suitable approach for social media environments.

机译：在许多NLP应用程序中，识别文档中的命名实体是一项关键任务。尽管当前用于此任务的最新方法在纯文本（例如新闻专栏类型）上具有很高的性能，但是当这些算法转移到嘈杂的环境（例如社交媒体域）时，它们的性能会大大降低。我们提供了两个系统，这些系统使用字符级语音和语音学，词嵌入和词性标记作为特征来应对处理社交媒体数据的挑战。第一个模型是多任务端到端双向长期短期记忆（BLSTM）-条件随机场（CRF）网络，其输出层包含两个CRF分类器。第二个模型使用多任务BLSTM网络作为特征提取器，将学习转移到CRF分类器中以进行最终预测。我们的系统在嘈杂的用户生成的Text 2017数据集研讨会上的当前Fl评分表现优于2.4％和3.69％，为社交媒体环境建立了更合适的方法。

著录项

来源
《Annual conference of the North American Chapter of the Association for Computational Linguistics: human language technologies》|2018年|1401-1412|共12页
会议地点
作者
Gustavo Aguilar; A. Pastor Lopez-Monroy; Fabio A. Gonzalez; Thamar Solorio;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Recognizing Nested Named Entity Based on the Neural Network Boundary Assembling Model [J] . Chen Yanping, Wu Yuefei, Qin Yongbin, IEEE intelligent systems . 2020,第1期

机译：基于神经网络边界组装模型识别嵌套命名实体
2. Recognizing Nested Named Entity Based on the Neural Network Boundary Assembling Model [J] . Chen Yanping, Wu Yuefei, Qin Yongbin, Trends in Ecology & Evolution . 2020,第1期

机译：基于神经网络边界组装模型识别嵌套命名实体
3. Named Entity Recognition Model Based on Neural Networks Using Parts of Speech Probability and Gazetteer Features [J] . Advanced Science Letters . 2017,第10期

机译：基于神经网络的名称实体识别模型，使用言语概率和凝视特征
4. Modeling Noisiness to Recognize Named Entities using Multitask Neural Networks on Social Media [C] . Gustavo Aguilar, A. Pastor Lopez-Monroy, Fabio A. Gonzalez, Annual conference of the North American Chapter of the Association for Computational Linguistics: human language technologies . 2018

机译：使用Multitask神经网络在社交媒体上识别命名实体的噪音模拟
5. Modeling and Discovering Authentic and Effective Influencers on Social Media via Graph Neural Network Learning [D] . Kim, Seungbae. 2021

机译：通过图形神经网络学习建模与发现社交媒体的真实有效的影响力
6. A multitask bi-directional RNN model for named entity recognition on Chinese electronic medical records [O] . Shanta Chowdhury, Xishuang Dong, Lijun Qian, 2018

机译：中国电子病历中命名实体识别的多任务双向RNN模型
7. F-Score Driven Max Margin Neural Network for Named Entity Recognition in Chinese Social Media [O] . He, Hangfeng, Sun, Xu 2017

机译：用于命名实体识别的F-score驱动最大边缘神经网络中国社交媒体

Modeling Noisiness to Recognize Named Entities using Multitask Neural Networks on Social Media

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅