Identification of discriminative features for biological event extraction throughlinguistically informed feature selection

Xing Zhang; Jingbo Xi; Jonathan Webster; Alex Chengyu Fang

首页> 外文期刊>Journal of food, agriculture & environment >Identification of discriminative features for biological event extraction throughlinguistically informed feature selection

【24h】

Identification of discriminative features for biological event extraction throughlinguistically informed feature selection

机译：通过识别生物事件提取的辨别特征，通过创新的特征选择

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Machine learning classifiers have achieved significant performance in the area of biomedical event extraction. For example, support vector machine SVM) classifiers in the Turku Event Extraction System achieved the best performance in BioNLP09 task. Such classifiers typically rely on the use )f large feature sets. Despite their robust performance, however, recent research has suggested that feature sets produced through automatic training need to be further optimized through size reduction in order toimprove system performance. The current paper attempts to identify ways to reduce the size of feature sets by investigating the contribution of four different feature sets constructed according to lexical, grammatical, syntactic and semantic information. It reports an experiment based on BioNLP data prepared by the Turku team for biological event extraction and examines to what extent the dimension of the feature sets can be reduced while the classifier can still achieve similar performance. The importance of each feature set is evaluated through a SVM classifier. Our experiments demonstrate that feature set construction according to lexical, grammatical and syntactic J information can effectively reduce the set size by as much as 86% while maintaining a comparable performance, hence significantly resolving the feature dimension issue. It is also shown through our experiments that a hybrid feature set constructed according to a combination of lexical and semantic information can achieve the second highest accuracy, hence indicating the useful feasibility of constructing an optimal feature set through dimension reduction and feature combination. We conclude that the experiments reported in the current paper have produced empirical evidence supportingthe importance of linguistic information for the construction of high-performance feature sets in addition to domain knowledge for the task of biomedical event extraction.

机译：机器学习分类器在生物医学事件提取领域取得了显着性能。例如，Turku事件提取系统中的支持向量机SVM）分类器在BiONLP09任务中实现了最佳性能。这种分类器通常依赖于使用）F大功能集。然而，尽管他们的性能强劲，但最近的研究表明，通过自动培训生产的功能集需要通过尺寸减少来进一步优化，以便进行系统性能。目前的纸张试图通过调查根据词汇，语法，句法和语义信息构建的四种不同特征集的贡献来识别减少特征集大小的方法。它报告了基于由土库普团队为生物事件提取制备的BionlP数据的实验，并在分类器仍然可以实现类似的性能的同时可以减少特征集的维度的程度。通过SVM分类器评估每个功能集的重要性。我们的实验表明，根据词法，语法和句法J信息的特征设定结构可以在保持相当的性能的同时有效地将设定大小减少到86％，因此显着解析了特征维度问题。还通过我们的实验示出了根据词汇和语义信息的组合构造的混合特征组可以实现第二最高精度，因此指示通过尺寸减小和特征组合构造最佳特征的有用可行性。我们得出结论，本文报告的实验已经产生了支持在域名知识外，支持对建设高性能特征的语言信息的重要性，以及生物医学事件提取任务的域名知识。

著录项

来源
《Journal of food, agriculture & environment》 |2013年第2期|共5页
作者
Xing Zhang; Jingbo Xi; Jonathan Webster; Alex Chengyu Fang;
展开▼
作者单位

The Halliday Centre for Intelligent Applications of Language Studies City University of Hong Kong Tat Chee Avenue Kowloon Hong Kong SAR;

Department of Chinese Translation and Linguistics City University of Hong Kong Tat Chee Avenue Kowloon Hong Kong SAR;

Department of Chinese Translation and Linguistics City University of Hong Kong Tat Chee Avenue Kowloon Hong Kong SAR;

Department of Chinese Translation and Linguistics City University of Hong Kong Tat Chee Avenue Kowloon Hong Kong SAR;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类食品工业;
关键词
Turku event extraction system; feature selection; event extraction; support vector machine; linguistic features; syntactic information; semantic information.;

机译：图库事件提取系统;特征选择;事件提取;支持向量机;语言特征;语法信息;语义信息。;

相似文献

外文文献
中文文献
专利

1. Identification of discriminative features for biological event extraction throughlinguistically informed feature selection [J] . Xing Zhang, Jingbo Xi, Jonathan Webster, Journal of food, agriculture & environment . 2013,第1aPta2期

机译：通过识别生物事件提取的辨别特征，通过创新的特征选择
2. Feature extraction and sensor selection for NPP initiating event identification [J] . Lin Ting-Han, Wu Shun-Chi, Chen Kuang-You, Annals of nuclear energy . 2017,第MAY期

机译：用于NPP启动事件识别的特征提取和传感器选择
3. DISCRIMINATIVE FEATURE EXTRACTION BASED ON SELF-ADAPTIVE FREQUENCY WARPING FOR ROBUST SPEAKER IDENTIFICATION [J] . YANPING LI, ZHENMIN TANG, HUI DING, International Journal of Information Acquisition . 2008,第4期

机译：基于自自适应频率包裹的鲁棒说话人鉴别特征提取
4. Discriminative region extraction and feature selection based on the combination of SURF and saliency [C] . Li Deng, Chunhong Wang, Changhui Rao ISPDI 2011;International symposium on photoelectronic detection and imaging . 2012

机译：基于SURF和显着性的判别区域提取和特征选择
5. ANALYSIS OF THE PERFORMANCE OF A PARAMETRIC AND NONPARAMETRIC CLASSIFICATION SYSTEM: AN APPLICATION TO FEATURE SELECTION AND EXTRACTION IN RADAR TARGET IDENTIFICATION. [D] . DJOUADI, ABDELHAMID. 1987

机译：参数和非参数分类系统的性能分析：在雷达目标识别中的特征选择和提取中的应用。
6. A robust tool for discriminative analysis and feature selection in paired samples impacts the identification of the genes essential for reprogramming lung tissue to adenocarcinoma [O] . Swee Heng Toh, Philip Prathipati, Efthimios Motakis, 2011

机译：用于配对样品的判别分析和特征选择的强大工具会影响对肺组织重编程为腺癌必不可少的基因的鉴定
7. A robust tool for discriminative analysis and feature selection in paired samples impacts the identification of the genes essential for reprogramming lung tissue to adenocarcinoma [O] . Swee Toh, Philip Prathipati, Efthimios Motakis, 2011

机译：用于配对样品的判别分析和特征选择的强大工具会影响对肺组织重编程为腺癌必不可少的基因的鉴定
8. Improved Feature Extraction, Feature Selection, and Identification Techniques That Create a Fast Unsupervised Hyperspectral Target Detection Algorithm [R] . Johnson, R. J. 2008

机译：改进的特征提取，特征选择和识别技术，创建快速无监督的高光谱目标检测算法

Identification of discriminative features for biological event extraction throughlinguistically informed feature selection

摘要

著录项

相似文献

相关主题

期刊订阅