首页> 外文会议> >Combined Classification for Extracting Named Entities from Arabic Texts

【24h】

Combined Classification for Extracting Named Entities from Arabic Texts

机译：从阿拉伯语文本中提取命名实体的组合分类

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this paper, we describe an approach for extracting named entities from Arabic texts. Arabic language is hard to process since its characteristics that influence, even, the NE extraction. For our case, we consider that the named entities extraction can be assimilated to a typical classification problem. Indeed, this extraction consists of searching for text portions that can be classified in a NE class (Person, Locality or Organization). Thus, we choose to use a supervised learning approach and employ the BIO tagging format that can solve the twin problems of segmentation and categorization. In addition, singular classifier cannot give good results for all types of contexts. Thus, we adopt a set of weighted classifiers which we combined through a voting procedure. In order to appreciate properly the performance of our system, we perform two types of tests: with and without morphological attributes. We consider that the results are highly satisfactory especially with a accuracy that exceeds 89% for both Person and Locality classes.

机译：在本文中，我们描述了一种从阿拉伯文本中提取命名实体的方法。阿拉伯语言很难处理，因为其特征甚至会影响NE提取。对于我们的情况，我们认为命名实体提取可以与典型的分类问题相提并论。实际上，此提取包括搜索可以归类为NE类（人员，位置或组织）的文本部分。因此，我们选择使用监督学习方法并采用BIO标记格式，该格式可以解决分割和分类的双重问题。另外，奇异分类器不能为所有类型的上下文提供良好的结果。因此，我们采用了一组加权分类器，这些分类器是通过表决程序组合而成的。为了正确地了解我们系统的性能，我们执行两种类型的测试：有和没有形态属性。我们认为结果非常令人满意，特别是对于“人物”和“地方”类的准确性都超过89％。

著录项

来源
《》|2015年|55-60|共6页
会议地点 Cairo(EG)
作者
F��riel Ben Fraj Trabelsi; Chiraz Ben Othmane Zribi; Wiem Kouki;
展开▼
作者单位

RIADI Lab., La Manouba Univ., Tunisia;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Arabic language; Named Entity Recognition; classification; combination;

机译：阿拉伯语言;命名实体识别;分类;组合;

相似文献

外文文献
中文文献
专利

1. Psychological named entity recognition from psychological Arabic texts [J] . Kheira Lakel, Fatima Bendella International journal of metadata, semantics and ontologies . 2017,第2a3期

机译：心理阿拉伯文本中的心理学命名实体识别
2. A real time Named Entity Recognition system for Arabic text mining [J] . Harith Al-Jumaily, Paloma Martinez, Jose L. Martinez-Fernandez, Language Resources and Evaluation . 2012,第4期

机译：用于阿拉伯文本挖掘的实时命名实体识别系统
3. A Survey of Arabic Named Entity Recognition and Classification [J] . Khaled Shaala Computational linguistics . 2014,第2期

机译：阿拉伯命名实体识别和分类调查
4. Combined Classification for Extracting Named Entities from Arabic Texts [C] . F??riel Ben Fraj Trabelsi, Chiraz Ben Othmane Zribi, Wiem Kouki International Conference on Arabic Computational Linguistics . 2016

机译：从阿拉伯语文本中提取命名实体的组合分类
5. Arabic Named Entity Recognition: A Corpus-Based Study [D] . Algahtani, Shabib. 2012

机译：阿拉伯语命名实体识别：基于语料库的研究
6. Combining automatic table classification and relationship extraction in extracting anticancer drug-side effect pairs from full-text articles [O] . Rong Xu, QuanQiu Wang -1

机译：结合自动表格分类和关系提取从全文文章中提取抗癌药物副作用对
7. Arabic Language Processing for Text Classification. Contributions to Arabic Root Extraction Techniques, Building An Arabic Corpus, and to Arabic Text Classification Techniques. [O] . Al-Nashashibi May Yacoub Adib 2012

机译：用于文本分类的阿拉伯语言处理。对阿拉伯语根提取技术，建立阿拉伯语语料库和阿拉伯文本分类技术的贡献。

Combined Classification for Extracting Named Entities from Arabic Texts

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅