首页> 外文期刊>Computational linguistics >Improving Accuracy in Word Class Tagging through the Combination of Machine Learning Systems
【24h】

Improving Accuracy in Word Class Tagging through the Combination of Machine Learning Systems

机译:结合机器学习系统提高词类标注的准确性

获取原文
获取原文并翻译 | 示例
           

摘要

We examine how differences in language models, learned by different data-driven systems performing the same NLP task, can be exploited to yield a higher accuracy than the best individual system. We do this by means of experiments involving the task of morphosyntactic word class tagging, on the basis of three different tagged corpora. Four well-known tagger generators (hidden Markov model, memory-based, transformation rules, and maximum entropy) are trained on the same corpus data. After comparison, their outputs are combined using several voting strategies and second-stage classifiers. All combination taggers outperform their best component. The reduction in error rate varies with the material in question, but can be as high as 24.3% with the LOB corpus.
机译:我们研究了如何利用执行同一NLP任务的不同数据驱动系统学习到的语言模型中的差异,来产生比最佳单个系统更高的准确性。我们通过在三个不同的已标记语料库的基础上,通过涉及形态句法词类标记任务的实验来做到这一点。在相同的语料库数据上训练了四个著名的标记生成器(隐马尔可夫模型,基于内存的转换规则和最大熵)。比较之后,它们的输出使用几种投票策略和第二阶段分类器进行合并。所有组合标记器均胜过其最佳组件。错误率的降低因所涉及的材料而异,但LOB语料库的错误率可能高达24.3%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号