...
首页> 外文期刊>Information retrieval >Sentence-level event classification in unstructured texts
【24h】

Sentence-level event classification in unstructured texts

机译:非结构化文本中的句子级事件分类

获取原文
获取原文并翻译 | 示例

摘要

The ability to correctly classify sentences that describe events is an important task for many natural language applications such as Question Answering (QA) and Text Summarisation. In this paper, we treat event detection as a sentence level text classification problem. Overall, we compare the performance of discriminative versus generative approaches to this task: namely, a Support Vector Machine (SVM) classifier versus a Language Modeling (LM) approach. We also investigate a rule-based method that uses handcrafted lists of 'trigger' terms derived from WordNet. Two datasets are used in our experiments to test each approach on six different event types, i.e., Die, Attack, Injure, Meet, Transport and Charge-Indict. Our experimental results show that the trained SVM classifier significantly outperforms the simple rule-based system and language modeling approach on both datasets: ACE (F1 66% vs. 45% and 38%, respectively) and IBC (F1 92% vs. 88% and 74%, respectively). A detailed error analysis framework for the task is also provided which separates errors into different types: semantic, inference, continuous and trigger-less.
机译:对描述事件的句子进行正确分类的能力是许多自然语言应用程序(例如问题解答(QA)和文本摘要)的一项重要任务。在本文中,我们将事件检测视为句子级文本分类问题。总体而言,我们比较了区分性方法和生成性方法在此任务上的性能:即支持向量机(SVM)分类器与语言建模(LM)方法。我们还研究了基于规则的方法,该方法使用了从WordNet派生的“触发”术语的手工列表。我们的实验中使用了两个数据集,以测试六种不同事件类型(即死亡,攻击,伤害,相遇,运输和指控)的每种方法。我们的实验结果表明,经过训练的SVM分类器在两个数据集上均优于简单的基于规则的系统和语言建模方法:ACE(F1分别为66%,45%和38%)和IBC(F1分别为92%和88%)。和74%)。还提供了针对该任务的详细错误分析框架,该框架将错误分为不同的类型:语义,推断,连续和无触发。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号