Statistical Modality Tagging from Rule-based Annotations and Crowdsourcing

机译：基于规则的注释和众包的统计模态标记

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We explore training an automatic modality tagger. Modality is the attitude that a speaker might have toward an event or state. One of the main hurdles for training a linguistic tagger is gathering training data. This is particularly problematic for training a tagger for modality because modality triggers are sparse for the overwhelming majority of sentences. We investigate an approach to automatically training a modality tagger where we first gathered sentences based on a high-recall simple rule-based modality tagger and then provided these sentences to Mechanical Turk annotators for further annotation. We used the resulting set of training data to train a precise modality tagger using a multi-class SVM that delivers good performance.

机译：我们探索培训自动模态标记器。偶数是发言者可能对事件或国家可能具有的态度。用于培训语言标记器的主要障碍是收集培训数据。这对于训练标记对于模态训练标签尤其有问题，因为模态触发对于绝大多数句子稀疏。我们调查一种自动培训模态标记器的方法，在那里我们首次基于高回忆简单规则的模型标记器的句子，然后向机械土库注入器提供这些句子以进行进一步注释。我们使用由Metecting的培训数据集用于使用多级SVM培训精确的模型标记器，可提供良好的性能。

著录项

来源
《Workshop on Extra-Propositional Aspects of Meaning in Computational Linguistics》|2012年||共8页
会议地点
作者
Vinodkumar Prabhakaran; Michael Bloodgood; Mona Diab; Bonnie Dorr; Lori Levin; Christine D. Piatko; Owen Rambow; Benjamin Van Durme;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. Description of the Chinese-to-Spanish Rule-Based Machine Translation System Developed Using a Hybrid Combination of Human Annotation and Statistical Techniques [J] . MARTA R. COSTA-JUSSA, JORDI CENTELLES ACM transactions on Asian language information processing . 2016,第1期

机译：基于人类注释和统计技术的混合开发的基于中文的西班牙语规则机器翻译系统的说明
2. Towards Modal Integration of Overhead and Underground Low-Voltage and Medium-Voltage Power Line Communication Channels in the Smart Grid Landscape: Model Expansion, Broadband Signal Transmission Characteristics, and Statistical Performance Metrics (Invited Paper) [J] . Athanasios G.Lazaropoulos International Scholarly Research Notices . 2012,第5期

机译：朝着智能电网景观中的开销和地下低压和中压力线通信通信通信通道的模态集成：模型扩展，宽带信号传输特性和统计性能指标（邀请纸）
3. Multi-Stage Automatic NE and PoS Annotation Using Pattern-Based and Statistical-Based Techniques for Thai Corpus Construction [J] . Nattapong TONGTEP, Thanaruk THEERAMUNKONG IEICE transactions on information and systems . 2013,第10期

机译：使用基于模式和基于统计的技术对泰国语料库进行多阶段自动NE和PoS注释
4. Statistical Modality Tagging from Rule-based Annotations and Crowdsourcing [C] . Vinodkumar Prabhakaran, Michael Bloodgood, Mona Diab, Workshop on Extra-Propositional Aspects of Meaning in Computational Linguistics 2012 . 2012

机译：基于规则的注释和众包的统计模式标记
5. Crowdsourcing annotation for machine learning in natural language processing tasks. [D] . Zaidan, Omar F. 2012

机译：用于自然语言处理任务中机器学习的众包注释。
6. A Statistical Approach to Correcting Cross-Annotations in a Metagenomic Functional Profile Generated by Short Reads [O] . Ruofei Du, Donald Mercante, Lingling An, -1

机译：校正短读所产生的元基因组功能谱中交叉注释的统计方法
7. Automatic Annotation and Assessment of Syntactic Structures in Law Texts Combining Rule-Based and Statistical Methods [O] . Sugisaki, Kyoko 2016

机译：基于规则和统计方法相结合的法律文本句法结构自动注释和评估

Statistical Modality Tagging from Rule-based Annotations and Crowdsourcing

摘要

著录项

相似文献

相关主题

期刊订阅