Combining feature selection and classifier ensemble using a multiobjective simulated annealing approach: Application to named entity recognition

Ekbal A.; Saha S.

首页> 外文期刊>Soft computing: A fusion of foundations, methodologies and applications >Combining feature selection and classifier ensemble using a multiobjective simulated annealing approach: Application to named entity recognition

【24h】

Combining feature selection and classifier ensemble using a multiobjective simulated annealing approach: Application to named entity recognition

机译：使用多目标模拟退火方法将特征选择和分类器集成相结合：在命名实体识别中的应用

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose a two-stage multiobjective-simulated annealing (MOSA)-based technique for named entity recognition (NER). At first, MOSA is used for feature selection under two statistical classifiers, viz. conditional random field (CRF) and support vector machine (SVM). Each solution on the final Pareto optimal front provides a different classifier. These classifiers are then combined together by using a new classifier ensemble technique based on MOSA. Several different versions of the objective functions are exploited. We hypothesize that the reliability of prediction of each classifier differs among the various output classes. Thus, in an ensemble system, it is necessary to find out the appropriate weight of vote for each output class in each classifier. We propose a MOSA-based technique to determine the weights for votes automatically. The proposed two-stage technique is evaluated for NER in Bengali, a resource-poor language, as well as for English. Evaluation results yield the highest recall, precision and F-measure values of 93.95, 95.15 and 94.55 %, respectively for Bengali and 89.01, 89.35 and 89.18 %, respectively for English. Experiments also suggest that the classifier ensemble identified by the proposed MOO-based approach optimizing the F-measure values of named entity (NE) boundary detection outperforms all the individual classifiers and four conventional baseline models.

机译：在本文中，我们提出了一种基于两阶段多目标模拟退火（MOSA）的命名实体识别（NER）技术。首先，MOSA用于两个统计分类器下的特征选择。条件随机场（CRF）和支持向量机（SVM）。最终Pareto最优方面的每个解决方案都提供不同的分类器。然后，使用基于MOSA的新分类器集成技术将这些分类器组合在一起。利用了目标函数的几种不同版本。我们假设每个分类器的预测可靠性在不同的输出类别之间是不同的。因此，在集成系统中，有必要为每个分类器中的每个输出类找出合适的投票权重。我们提出了一种基于MOSA的技术来自动确定投票权重。对于孟加拉语中的NER（一种资源匮乏的语言）以及英语，对提出的两阶段技术进行了评估。评估结果显示，孟加拉语的最高召回率，准确性和F量度值分别为93.95％，95.15％和94.55％，英语为89.01％，89.35％和89.18％。实验还表明，通过基于拟议的基于MOO的方法优化命名实体（NE）边界检测的F度量值所识别的分类器集合优于所有单个分类器和四个常规基线模型。

著录项

来源
《Soft computing: A fusion of foundations, methodologies and applications》 |2013年第1期|共16页
作者
Ekbal A.; Saha S.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算机软件;
关键词
Classifier ensemble; Conditional random field (CRF); Maximum entropy (ME); Multiobjective optimization (MOO); Named entity recognition; Natural language processing; Simulated annealing (SA); Support vector machine (SVM); Weighted voting;

机译：分类器集合;条件随机场（CRF）;最大熵（ME）;多目标优化（MOO）;命名实体识别;自然语言处理;模拟退火（SA）;支持向量机（SVM）;加权投票;

相似文献

外文文献
中文文献
专利

1. Combining feature selection and classifier ensemble using a multiobjective simulated annealing approach: Application to named entity recognition [J] . Ekbal A., Saha S. Soft computing: A fusion of foundations, methodologies and applications . 2013,第1期

机译：使用多目标模拟退火方法将特征选择和分类器集成相结合：在命名实体识别中的应用
2. Multiobjective optimization for classifier ensemble and feature selection: an application to named entity recognition [J] . Asif Ekbal, Sriparna Saha International Journal on Document Analysis and Recognition . 2012,第2期

机译：分类器集成和特征选择的多目标优化：在命名实体识别中的应用
3. Simultaneous feature and parameter selection using multiobjective optimization: application to named entity recognition [J] . Ekbal Asif, Saha Sriparna International journal of machine learning and cybernetics . 2016,第4期

机译：使用多目标优化同时进行特征和参数选择：在命名实体识别中的应用
4. Differential Evolution based Feature Selection and Classifier Ensemble for Named Entity Recognition [C] . Utpal Kumar Sikdar, Asif Ekbal, Sriparna Saha International conference on computational linguistics . 2012

机译：基于差分进化的特征选择和分类器集合用于命名实体识别
5. Advancing Biomedical Named Entity Recognition with Multivariate Feature Selection and Semantically Motivated Features. [D] . Leaman, James Robert, Jr. 2013

机译：具有多元特征选择和语义动机特征的生物医学命名实体识别。
6. Phenotype Recognition with Combined Features and Random Subspace Classifier Ensemble [O] . Bailing Zhang, Tuan D Pham 2011

机译：具有组合特征和随机子空间分类器集合的表型识别
7. Finding Appropriate Subset of Votes Per Classifier Using Multiobjective Optimization: Application to Named Entity Recognition [O] . Ekbal Asif, Saha Sriparna, Hasanuzzaman Md. 2011

机译：使用多目标优化为每个分类器找到合适的投票子集：在命名实体识别中的应用

Combining feature selection and classifier ensemble using a multiobjective simulated annealing approach: Application to named entity recognition

摘要

著录项

相似文献

相关主题

期刊订阅