Simultaneous feature and parameter selection using multiobjective optimization: application to named entity recognition

Ekbal Asif; Saha Sriparna

首页> 外文期刊>International journal of machine learning and cybernetics >Simultaneous feature and parameter selection using multiobjective optimization: application to named entity recognition

【24h】

Simultaneous feature and parameter selection using multiobjective optimization: application to named entity recognition

机译：使用多目标优化同时进行特征和参数选择：在命名实体识别中的应用

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose an efficient algorithm based on the concept of multiobjective optimization (MOO) for performing feature selection and parameter optimization of any machine learning technique. Feature and parameter combinations have significant effect to the accuracy of the classifier. We perform feature selection and parameter optimization for four different classifiers, namely conditional random field, support vector machine, memory based learner and maximum entropy. The proposed algorithms are evaluated for solving the problems of named entity recognition, an important component in many text processing applications. Currently we experiment with four different languages, namely Bengali, Hindi, Telugu and English. At first the proposed MOO based technique is used to determine the appropriate features and parameters. For each of the classifiers, the algorithm produces a set of solutions on the final Pareto optimal front. Each solution represents a classifier with a particular feature and parameter combination. All these solutions are thereafter combined using a MOO based classifier ensemble technique. Evaluation results show that the proposed approach attains the F-measure (harmonic mean of recall and precision) values of 90.48, 90.44, 78.71 and 88.68 % for Bengali, Hindi, Telugu and English, respectively. We also show that for all the experimental settings the proposed feature and parameter optimization technique performs reasonably better than the baseline systems, developed with random feature subsets. Comparisons with the existing works also show the efficacy of our proposed algorithm.

机译：在本文中，我们提出了一种基于多目标优化（MOO）概念的有效算法，用于执行任何机器学习技术的特征选择和参数优化。特征和参数组合对分类器的准确性有重要影响。我们对四个不同的分类器进行特征选择和参数优化，即条件随机场，支持向量机，基于记忆的学习器和最大熵。对提出的算法进行了评估，以解决命名实体识别的问题，命名实体识别是许多文本处理应用程序中的重要组成部分。目前，我们尝试使用四种不同的语言，分别是孟加拉语，印地语，泰卢固语和英语。首先，所提出的基于MOO的技术用于确定适当的特征和参数。对于每个分类器，该算法都会在最终的帕累托最优前沿上产生一组解。每个解决方案代表具有特定功能和参数组合的分类器。此后，所有这些解决方案都使用基于MOO的分类器集成技术进行组合。评估结果表明，对于孟加拉语，北印度语，泰卢固语和英语，该方法的F度量（召回率和精确度的谐和均值）分别达到90.48％，90.44、78.71和88.68％。我们还表明，对于所有实验设置，所提出的特征和参数优化技术的性能均比使用随机特征子集开发的基线系统更好。与现有工作的比较也表明了我们提出的算法的有效性。

著录项

来源
《International journal of machine learning and cybernetics》 |2016年第4期|597-611|共15页
作者
Ekbal Asif; Saha Sriparna;
展开▼
作者单位

Indian Inst Technol, Dept Comp Sci & Engn, Patna, Bihar, India;

Indian Inst Technol, Dept Comp Sci & Engn, Patna, Bihar, India;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Named entity recognition (NER); Feature selection; Parameter selection; Machine learning; Multiobjective optimization;

机译：命名实体识别（NER）;特征选择;参数选择;机器学习;多目标优化;

相似文献

外文文献
中文文献
专利

1. Multiobjective optimization for classifier ensemble and feature selection: an application to named entity recognition [J] . Asif Ekbal, Sriparna Saha International Journal on Document Analysis and Recognition . 2012,第2期

机译：分类器集成和特征选择的多目标优化：在命名实体识别中的应用
2. Combining feature selection and classifier ensemble using a multiobjective simulated annealing approach: Application to named entity recognition [J] . Ekbal A., Saha S. Soft computing: A fusion of foundations, methodologies and applications . 2013,第1期

机译：使用多目标模拟退火方法将特征选择和分类器集成相结合：在命名实体识别中的应用
3. New approach for Arabic named entity recognition on social media based on feature selection using genetic algorithm [J] . Brahim Ait Benali, Soukaina Mihi, Ismail El Bazi, International Journal of Electrical and Computer Engineering . 2021,第2期

机译：基于特征选择的阿拉伯语命名实体识别的新方法使用遗传算法
4. Feature Selection Using Multiobjective Optimization for Named Entity Recognition [C] . Ekbal Asif, Saha Sriparna, Garbe Christoph S. 2010 20th International Conference on Pattern Recognition . 2010

机译：基于多目标优化的命名实体识别特征选择
5. Advancing Biomedical Named Entity Recognition with Multivariate Feature Selection and Semantically Motivated Features. [D] . Leaman, James Robert, Jr. 2013

机译：具有多元特征选择和语义动机特征的生物医学命名实体识别。
6. An improved chaotic fruit fly optimization based on a mutation strategy for simultaneous feature selection and parameter optimization for SVM and its applications [O] . Fei Ye, Xin Yuan Lou, Lin Fu Sun -1

机译：改进的基于变异策略的混沌果蝇优化支持向量机同时特征选择和参数优化及其应用
7. Finding Appropriate Subset of Votes Per Classifier Using Multiobjective Optimization: Application to Named Entity Recognition [O] . Ekbal Asif, Saha Sriparna, Hasanuzzaman Md. 2011

机译：使用多目标优化为每个分类器找到合适的投票子集：在命名实体识别中的应用

Simultaneous feature and parameter selection using multiobjective optimization: application to named entity recognition

摘要

著录项

相似文献

相关主题

期刊订阅