Identification of discriminative features for biological event extraction through linguistically informed feature selection

Zhang Xing; Xia Jingbo; Webster Jonathan; Fang Alex Chengyu

首页> 外文期刊>Journal of food, agriculture & environment >Identification of discriminative features for biological event extraction through linguistically informed feature selection

【24h】

Identification of discriminative features for biological event extraction through linguistically informed feature selection

机译：通过语言告知的特征选择识别用于生物事件提取的区分特征

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Machine learning classifiers have achieved significant performance in the area of biomedical event extraction. For example, support vector machine (SVM) classifiers in the Turku Event Extraction System achieved the best performance in BioNLP09 task. Such classifiers typically rely on the use of large feature sets. Despite their robust performance, however, recent research has suggested that feature sets produced through automatic training need to be further optimized through size reduction in order to improve system performance. The current paper attempts to identify ways to reduce the size of feature sets by investigating the contribution of four different feature sets constructed according to lexical, grammatical, syntactic and semantic information. It reports an experiment based on BioNLP data prepared by the Turku team for biological event extraction and examines to what extent the dimension of the feature sets can be reduced while the classifier can still achieve similar performance. The importance of each feature set is evaluated through a SVM classifier. Our experiments demonstrate that feature set construction according to lexical, grammatical and syntactic information can effectively reduce the set size by as much as 86% while maintaining a comparable performance, hence significantly resolving the feature dimension issue. It is also shown through our experiments that a hybrid feature set constructed according to a combination of lexical and semantic information can achieve the second highest accuracy, hence indicating the useful feasibility of constructing an optimal feature set through dimension reduction and feature combination. We conclude that the experiments reported in the current paper have produced empirical evidence supporting the importance of linguistic information for the construction of high-performance feature sets in addition to domain knowledge for the task of biomedical event extraction.

机译：机器学习分类器在生物医学事件提取领域取得了显着的性能。例如，图尔库事件提取系统中的支持向量机（SVM）分类器在BioNLP09任务中获得了最佳性能。这样的分类器通常依赖于大型功能集的使用。尽管它们具有强大的性能，但是最近的研究表明，通过自动训练生成的功能集需要通过减小尺寸来进一步优化，以提高系统性能。本文试图通过调查根据词汇，语法，句法和语义信息构造的四个不同特征集的贡献，来确定减小特征集大小的方法。它报告了基于Turku团队准备的BioNLP数据进行生物事件提取的实验，并检查了特征集的维数可以减少到什么程度，而分类器仍然可以达到类似的性能。每个功能集的重要性通过SVM分类器进行评估。我们的实验表明，根据词汇，语法和句法信息构造的特征集可以有效地将集合大小减少多达86％，同时保持可比的性能，从而显着解决了特征维问题。通过我们的实验还表明，根据词汇和语义信息的组合构建的混合特征集可以达到第二高的准确性，因此表明了通过降维和特征组合来构建最佳特征集的有用可行性。我们得出的结论是，本论文中报道的实验已经产生了经验证据，证明语言信息对于构建高性能功能集以及生物医学事件提取任务的领域知识的重要性。

著录项

来源
《Journal of food, agriculture & environment》 |2013年第1期|共5页
作者
Zhang Xing; Xia Jingbo; Webster Jonathan; Fang Alex Chengyu;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类食品工业;
关键词
Turku event extraction system; feature selection; event extraction; support vector machine; linguist;

机译：Turku事件提取系统;特征选择;事件提取;支持向量机;语言学家;

相似文献

外文文献
中文文献
专利

1. Identification of discriminative features for biological event extraction through linguistically informed feature selection [J] . Zhang Xing, Xia Jingbo, Webster Jonathan, Journal of food, agriculture & environment . 2013,第1aPta1期

机译：通过语言告知的特征选择识别用于生物事件提取的区分特征
2. Feature extraction and sensor selection for NPP initiating event identification [J] . Lin Ting-Han, Wu Shun-Chi, Chen Kuang-You, Annals of nuclear energy . 2017,第MAY期

机译：用于NPP启动事件识别的特征提取和传感器选择
3. DISCRIMINATIVE FEATURE EXTRACTION BASED ON SELF-ADAPTIVE FREQUENCY WARPING FOR ROBUST SPEAKER IDENTIFICATION [J] . YANPING LI, ZHENMIN TANG, HUI DING, International Journal of Information Acquisition . 2008,第4期

机译：基于自自适应频率包裹的鲁棒说话人鉴别特征提取
4. Advancing Linguistic Features and Insights by Label-informed Feature Grouping: An Exploration in the Context of Native Language Identification [C] . Serhiy Bykh, Detmar Meurers International conference on computational linguistics . 2016

机译：通过标签通知的特征分组提高语言特征和见解：母语识别上下文中的探索
5. ANALYSIS OF THE PERFORMANCE OF A PARAMETRIC AND NONPARAMETRIC CLASSIFICATION SYSTEM: AN APPLICATION TO FEATURE SELECTION AND EXTRACTION IN RADAR TARGET IDENTIFICATION. [D] . DJOUADI, ABDELHAMID. 1987

机译：参数和非参数分类系统的性能分析：在雷达目标识别中的特征选择和提取中的应用。
6. A robust tool for discriminative analysis and feature selection in paired samples impacts the identification of the genes essential for reprogramming lung tissue to adenocarcinoma [O] . Swee Heng Toh, Philip Prathipati, Efthimios Motakis, 2011

机译：用于配对样品的判别分析和特征选择的强大工具会影响对肺组织重编程为腺癌必不可少的基因的鉴定
7. Towards Effective Entity Extraction of Scientific Documents using Discriminative Linguistic Features [O] . 2019

机译：利用鉴别语言特征对科学文件的有效实体提取
8. Improved Feature Extraction, Feature Selection, and Identification Techniques That Create a Fast Unsupervised Hyperspectral Target Detection Algorithm [R] . Johnson, R. J. 2008

机译：改进的特征提取，特征选择和识别技术，创建快速无监督的高光谱目标检测算法

Identification of discriminative features for biological event extraction through linguistically informed feature selection

摘要

著录项

相似文献

相关主题

期刊订阅