首页> 外文会议>Workshop on Extra-Propositional Aspects of Meaning in Computational Linguistics >Statistical Modality Tagging from Rule-based Annotations and Crowdsourcing
【24h】

Statistical Modality Tagging from Rule-based Annotations and Crowdsourcing

机译:基于规则的注释和众包的统计模态标记

获取原文

摘要

We explore training an automatic modality tagger. Modality is the attitude that a speaker might have toward an event or state. One of the main hurdles for training a linguistic tagger is gathering training data. This is particularly problematic for training a tagger for modality because modality triggers are sparse for the overwhelming majority of sentences. We investigate an approach to automatically training a modality tagger where we first gathered sentences based on a high-recall simple rule-based modality tagger and then provided these sentences to Mechanical Turk annotators for further annotation. We used the resulting set of training data to train a precise modality tagger using a multi-class SVM that delivers good performance.
机译:我们探索培训自动模态标记器。偶数是发言者可能对事件或国家可能具有的态度。用于培训语言标记器的主要障碍是收集培训数据。这对于训练标记对于模态训练标签尤其有问题,因为模态触发对于绝大多数句子稀疏。我们调查一种自动培训模态标记器的方法,在那里我们首次基于高回忆简单规则的模型标记器的句子,然后向机械土库注入器提供这些句子以进行进一步注释。我们使用由Metecting的培训数据集用于使用多级SVM培训精确的模型标记器,可提供良好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号