Sentence-level event classification in unstructured texts

M. Naughton; N. Stokes; J. Carthy

首页> 外文期刊>Information retrieval >Sentence-level event classification in unstructured texts

【24h】

Sentence-level event classification in unstructured texts

机译：非结构化文本中的句子级事件分类

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The ability to correctly classify sentences that describe events is an important task for many natural language applications such as Question Answering (QA) and Text Summarisation. In this paper, we treat event detection as a sentence level text classification problem. Overall, we compare the performance of discriminative versus generative approaches to this task: namely, a Support Vector Machine (SVM) classifier versus a Language Modeling (LM) approach. We also investigate a rule-based method that uses handcrafted lists of 'trigger' terms derived from WordNet. Two datasets are used in our experiments to test each approach on six different event types, i.e., Die, Attack, Injure, Meet, Transport and Charge-Indict. Our experimental results show that the trained SVM classifier significantly outperforms the simple rule-based system and language modeling approach on both datasets: ACE (F1 66% vs. 45% and 38%, respectively) and IBC (F1 92% vs. 88% and 74%, respectively). A detailed error analysis framework for the task is also provided which separates errors into different types: semantic, inference, continuous and trigger-less.

机译：对描述事件的句子进行正确分类的能力是许多自然语言应用程序（例如问题解答（QA）和文本摘要）的一项重要任务。在本文中，我们将事件检测视为句子级文本分类问题。总体而言，我们比较了区分性方法和生成性方法在此任务上的性能：即支持向量机（SVM）分类器与语言建模（LM）方法。我们还研究了基于规则的方法，该方法使用了从WordNet派生的“触发”术语的手工列表。我们的实验中使用了两个数据集，以测试六种不同事件类型（即死亡，攻击，伤害，相遇，运输和指控）的每种方法。我们的实验结果表明，经过训练的SVM分类器在两个数据集上均优于简单的基于规则的系统和语言建模方法：ACE（F1分别为66％，45％和38％）和IBC（F1分别为92％和88％）。和74％）。还提供了针对该任务的详细错误分析框架，该框架将错误分为不同的类型：语义，推断，连续和无触发。

著录项

来源
《Information retrieval 》 |2010年第2期| p.132-156| 共25页
作者
M. Naughton; N. Stokes; J. Carthy;
展开▼
作者单位

School of Computer Science and Informatics, University College Dublin, Dublin, Ireland;

School of Computer Science and Informatics, University College Dublin, Dublin, Ireland;

School of Computer Science and Informatics, University College Dublin, Dublin, Ireland;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
information extraction; event detection; language modeling; machine learning;

机译：信息提取;事件检测;语言建模;机器学习;

相似文献

外文文献
中文文献
专利

1. Using data-driven feature enrichment of text representation and ensemble technique for sentence-level polarity classification [J] . Pu Zhang, Zhongshi He Journal of Information Science . 2015 ,第4期

机译：使用数据驱动的文本表示特征丰富和合奏技术进行句子级极性分类
2. A Hybrid of Sentence-Level Approach and Fragment-Level Approach of Parallel Text Extraction from Comparable Text [J] . Yin-Lai Yeong, Tien-Ping Tan, Keng Hoon Gan Procedia Computer Science . 2019 ,第12期

机译：从可比文本中提取句子水平方法和片段水平方法的并行文本
3. DEEPENING HISTORICAL GIS: AN INTEGRATED DATABASE SOLUTION FOR LINKING PEOPLE, PLACE AND EVENTS THROUGH UNSTRUCTURED TEXT [J] . Jim Schindling, Trevor M. Harris History and Computing . 2018 ,第2期

机译：深化历史GIS：通过非结构化文本链接人员，地点和事件的综合数据库解决方案
4. Investigating Statistical Techniques for Sentence-Level Event Classification [C] . Martina Naughton, Nicola Stokes, Joe Carthy 22nd International Conference on Computational Linguistics . 2008

机译：句子级事件分类的统计技术研究
5. Performance of classification tools on unstructured text. [D] . Kourik, Janet L. 2005

机译：分类工具对非结构化文本的性能。
6. Supporting the use of standardized nursing terminologies with automatic subject heading prediction: a comparison of sentence-level text classification methods [O] . Hans Moen, Kai Hakala, Laura-Maria Peltonen, 2020

机译：支持标准护理术语与自动主题预测的使用：句子级文本分类方法的比较
7. Sentence-Level Event Classification in Unstructured Texts [O] . Martina Naughton, Nicola Stokes, Joe Carthy 2008

机译：非结构化文本中的句子级事件分类

Sentence-level event classification in unstructured texts

摘要

著录项

相似文献

相关主题

期刊订阅