Differential Evolution based Feature Selection and Classifier Ensemble for Named Entity Recognition

机译：基于差分进化的特征选择和分类器集合用于命名实体识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose a differential evolution (DE) based two-stage evolutionary approach for named entity recognition (NER). The first stage concerns with the problem of relevant feature selection for NER within the frameworks of two popular machine learning algorithms, namely Conditional Random Field (CRF) and Support Vector Machine (SVM). The solutions of the final best population provides different diverse set of classifiers; some are effective with respect to recall whereas some are effective with respect to precision. In the second stage we propose a novel technique for classifier ensemble for combining these classifiers. The approach is very general and can be applied for any classification problem. Currently we evaluate the proposed algorithm for NER in three popular Indian languages, namely Bengali, Hindi and Telugu. In order to maintain the domain-independence property the features are selected and developed mostly without using any deep domain knowledge and/or language dependent resources. Experimental results show that the proposed two stage technique attains the final F-measure values of 88.89%, 88.09% and 76.63% for Bengali, Hindi and Telugu, respectively. The key contributions of this work are two-fold, viz. (ⅰ). proposal of differential evolution (DE) based feature selection and classifier ensemble methods that can be applied to any classification problem; and (ⅱ). scope of the development of language independent NER systems in a resource-poor scenario.

机译：在本文中，我们提出了一种基于差分进化（DE）的两阶段进化方法，用于命名实体识别（NER）。第一阶段涉及在两种流行的机器学习算法（即条件随机场（CRF）和支持向量机（SVM））的框架内为NER进行相关特征选择的问题。最终最佳总体的解决方案提供了不同的分类器集;有些在召回方面有效，而有些在准确性方面有效。在第二阶段，我们提出了一种新的分类器集成技术，用于将这些分类器组合在一起。该方法非常通用，可以应用于任何分类问题。目前，我们以三种流行的印度语（孟加拉语，北印度语和泰卢固语）评估NER的拟议算法。为了维持域独立性，大部分特征是在不使用任何深层知识和/或语言相关资源的情况下进行选择和开发的。实验结果表明，所提出的两阶段技术对孟加拉语，北印度语和泰卢固语的最终F值分别达到88.89％，88.09％和76.63％。这项工作的关键贡献是双重的，即。（ⅰ）。提出了可用于任何分类问题的基于差分进化（DE）的特征选择和分类器集成方法的建议;和（ⅱ）。资源贫乏的情况下独立于语言的NER系统的开发范围。

著录项

来源
《International conference on computational linguistics》|2012年|2475-2490|共16页
会议地点
作者
Utpal Kumar Sikdar; Asif Ekbal; Sriparna Saha;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Named Entity Recognition; Differential Evolution; Feature Selection; Classifier Ensemble;

机译：命名实体识别;差异演化;特征选择;分类器乐团;

相似文献

外文文献
中文文献
专利

1. Combining feature selection and classifier ensemble using a multiobjective simulated annealing approach: Application to named entity recognition [J] . Ekbal A., Saha S. Soft computing: A fusion of foundations, methodologies and applications . 2013,第1期

机译：使用多目标模拟退火方法将特征选择和分类器集成相结合：在命名实体识别中的应用
2. Multiobjective optimization for classifier ensemble and feature selection: an application to named entity recognition [J] . Asif Ekbal, Sriparna Saha International Journal on Document Analysis and Recognition . 2012,第2期

机译：分类器集成和特征选择的多目标优化：在命名实体识别中的应用
3. Combining multiple classifiers using vote based classifier ensemble technique for named entity recognition [J] . Sriparna Saha, Asif Ekbal Data & Knowledge Engineering . 2013,第may期

机译：使用基于投票的分类器集成技术组合多个分类器以进行命名实体识别
4. Differential Evolution based Feature Selection and Classifier Ensemble for Named Entity Recognition [C] . Utpal Kumar Sikdar, Asif Ekbal, Sriparna Saha International conference on computational linguistics . 2012

机译：基于差分演进的特征选择和分类器组合，用于命名实体识别
5. Advancing Biomedical Named Entity Recognition with Multivariate Feature Selection and Semantically Motivated Features. [D] . Leaman, James Robert, Jr. 2013

机译：具有多元特征选择和语义动机特征的生物医学命名实体识别。
6. ECFS-DEA: an ensemble classifier-based feature selection for differential expression analysis on expression profiles [O] . Xudong Zhao, Qing Jiao, Hangyu Li, 2020

机译：ECFS-DEA：基于整体分类器的特征选择用于表达谱上的差异表达分析
7. Dutch named entity recognition using classifier ensembles [O] . Desmet Bart, Hoste Veronique 2011

机译：荷兰语使用分类器集合命名实体识别

Differential Evolution based Feature Selection and Classifier Ensemble for Named Entity Recognition

摘要

著录项

相似文献

相关主题

期刊订阅