首页> 外文会议>Conference on empirical methods in natural language processing >A Simpler and More Generalizable Story Detector using Verb and Character Features
【24h】

A Simpler and More Generalizable Story Detector using Verb and Character Features

机译:使用动词和字符功能更简单,更广泛的故事探测器

获取原文

摘要

Story detection is the task of determining whether or not a unit of text contains a story. Prior approaches achieved a maximum performance of 0.66 F_1, and did not generalize well across different corpora. We present a new state-of-the-art detector that achieves a maximum performance of 0.75 F_1 (a 14% improvement), with significantly greater general-lzability than previous work. In particular, our detector achieves performance above 0.70 F_1 across a variety of combinations of lexically different corpora for training and testing, as well as dramatic improvements (up to 4,000%) in performance when trained on a small, disfluent data set. The new detector uses two basic types of features-ones related to events, and ones related to characters-totaling 283 specific features overall, previous detectors used tens of thousands of features, and so this detector represents a significant simplification along with increased performance.
机译:故事检测是确定文本单位是否包含故事的任务。现有方法实现了0.66 F_1的最大性能,并且在不同的基础上没有概括。我们提出了一种新的最先进的探测器,实现了0.75 f_1(改进14%)的最大性能,其一般易达到比以前的工作更大。特别是,我们的探测器在Lexly不同的Corpora的各种组合中实现了0.70 f_1以进行培训和测试,以及在培训时在小型无流失的数据集上培训时的戏剧性改进(高达4,000%)。新探测器使用与事件相关的两个基本类型的功能 - 与字符相关的字符总计283个特定功能,以前的探测器使用了数万个功能,因此该检测器代表了显着的简化以及性能的显着简化。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号