首页> 外文会议>Conference on empirical methods in natural language processing >A Simpler and More Generalizable Story Detector using Verb and Character Features
【24h】

A Simpler and More Generalizable Story Detector using Verb and Character Features

机译:使用动词和角色功能的更简单,更通用的故事检测器

获取原文

摘要

Story detection is the task of determining whether or not a unit of text contains a story. Prior approaches achieved a maximum performance of 0.66 F_1, and did not generalize well across different corpora. We present a new state-of-the-art detector that achieves a maximum performance of 0.75 F_1 (a 14% improvement), with significantly greater general-lzability than previous work. In particular, our detector achieves performance above 0.70 F_1 across a variety of combinations of lexically different corpora for training and testing, as well as dramatic improvements (up to 4,000%) in performance when trained on a small, disfluent data set. The new detector uses two basic types of features-ones related to events, and ones related to characters-totaling 283 specific features overall, previous detectors used tens of thousands of features, and so this detector represents a significant simplification along with increased performance.
机译:故事检测是确定文本单元是否包含故事的任务。先前的方法获得的最大性能为0.66 F_1,并且在不同语料库中的推广效果不佳。我们提出了一种最新的探测器,该探测器可实现0.75 F_1的最大性能(提高了14%),其通用性比以前的工作大得多。特别是,我们的检测器在各种词法不同的语料库进行训练和测试的各种组合中均达到0.70 F_1以上的性能,并且在使用较小的分散数据集进行训练时,其性能也有了显着提高(最高4,000%)。新的检测器使用两种基本类型的特征-与事件相关的特征与与字符相关的特征-总共总共283个特定特征,以前的检测器使用了成千上万的特征,因此该检测器代表了极大的简化以及更高的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号