Named Entity Recognition on Arabic-English Code-Mixed Data

机译：阿拉伯语-英语代码混合数据的命名实体识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

As a result of globalization and better quality of education, a significant percentage of the population in Arab countries have become bilingual/multilingual. This has raised to the frequency of code-switching and code-mixing among Arabs in daily communication. Consequently, huge amount of Code-Mixed (CM) content can be found on different social media platforms. Such data could be analyzed and used in different Natural Language Processing (NLP) tasks to tackle the challenges emerging due to this multilingual phenomenon. Named Entity Recognition (NER) is one of the major tasks for several NLP systems. It is the process of identifying named entities in text. However, there is a lack of annotated CM data and resources for such task. This work aims at collecting and building the first annotated CM Arabic-English corpus for NER. Furthermore, we constructed a baseline NER system using deep neural networks and word embedding for Arabic-English CM text and enhanced it using a pooling technique.

机译：全球化和更好的教育质量的结果是，阿拉伯国家的很大一部分人口已经使用双语/多语种。这就增加了阿拉伯人在日常交流中进行代码切换和代码混合的频率。因此，可以在不同的社交媒体平台上找到大量的代码混合（CM）内容。可以分析此类数据并将其用于不同的自然语言处理（NLP）任务中，以解决由于这种多语言现象而出现的挑战。命名实体识别（NER）是几个NLP系统的主要任务之一。这是在文本中标识命名实体的过程。但是，缺少用于此类任务的带注释的CM数据和资源。这项工作旨在为NER收集和建立第一个带注释的CM阿拉伯语-英语语料库。此外，我们使用深度神经网络和阿拉伯语-英语CM文本的词嵌入功能构建了一个基线NER系统，并使用合并技术对其进行了增强。

著录项

来源
《IEEE International Conference on Semantic Computing》|2019年|93-97|共5页
会议地点
作者
Caroline Sabty; Mohamed Elmahdy; Slim Abdennadher;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Task analysis; Hidden Markov models; Natural language processing; Twitter; Neural networks; Support vector machines;

机译：任务分析;隐马尔可夫模型;自然语言处理; Twitter;神经网络;支持向量机;
入库时间 2022-08-26 13:53:16

相似文献

外文文献
中文文献
专利

1. Constructing a Lexicon of Arabic-English Named Entity using SMT and Semantic Linked Data [J] . Hkiri Emna, Mallat Souheyl, Zrigui Mounir, The international arab journal of information technology . 2017,第6期

机译：使用SMT和语义链接数据构造阿拉伯语-英语命名实体的词典
2. Data Augmentation Techniques on Arabic Data for Named Entity Recognition [J] . Caroline Sabty, Islam Omar, Fady Wasfalla, Procedia Computer Science . 2021,第a期

机译：用于命名实体识别的阿拉伯语数据的数据增强技术
3. Myanmar named entity corpus and its use in syllable-based neural named entity recognition [J] . Hsu Myat Mo, Khin Mar Soe International Journal of Electrical and Computer Engineering . 2020,第2期

机译：缅甸名为实体语料库及其在基于音节的神经名为实体识别中的用途
4. Named Entity Recognition on Arabic-English Code-Mixed Data [C] . Caroline Sabty, Mohamed Elmahdy, Slim Abdennadher IEEE International Conference on Semantic Computing . 2019

机译：以阿拉伯语 - 英语代码混合数据命名实体识别
5. A data-intensive approach to named entity recognition using domain and language independent methods [D] . Osesina, Olukayode Isaac. 2010

机译：使用领域和语言无关的方法进行的数据密集型命名实体识别方法
6. Increasing metadata coverage of SRA BioSample entries using deep learning–based named entity recognition [O] . Adam Klie, Brian Y Tsui, Shamim Mollah, 2021

机译：使用基于深度学习的命名实体识别增加SRA生物分析条目的元数据覆盖范围
7. Corpus Creation and Analysis for Named Entity Recognition in Telugu-English Code-Mixed Social Media Data [O] . Vamshi Krishna Srirangam, Appidi Abhinav Reddy, Vinay Singh, 2019

机译：Telugu-English-English-Mixed社交媒体数据中命名实体识别的语料库创建和分析
8. Naming Forum: Proceedings of the IRDS Workshop on Data Entity Naming Conventions [R] . Newton, J. J. 1990

机译：命名论坛：IRDs数据实体命名约定研讨会的会议记录

Named Entity Recognition on Arabic-English Code-Mixed Data

摘要

著录项

相似文献

相关主题

期刊订阅