Improving Accuracy in Word Class Tagging through the Combination of Machine Learning Systems

Hans van Halteren; Jakub Zavrel; Walter Daelemans

首页> 外文期刊>Computational linguistics >Improving Accuracy in Word Class Tagging through the Combination of Machine Learning Systems

【24h】

Improving Accuracy in Word Class Tagging through the Combination of Machine Learning Systems

机译：结合机器学习系统提高词类标注的准确性

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We examine how differences in language models, learned by different data-driven systems performing the same NLP task, can be exploited to yield a higher accuracy than the best individual system. We do this by means of experiments involving the task of morphosyntactic word class tagging, on the basis of three different tagged corpora. Four well-known tagger generators (hidden Markov model, memory-based, transformation rules, and maximum entropy) are trained on the same corpus data. After comparison, their outputs are combined using several voting strategies and second-stage classifiers. All combination taggers outperform their best component. The reduction in error rate varies with the material in question, but can be as high as 24.3% with the LOB corpus.

机译：我们研究了如何利用执行同一NLP任务的不同数据驱动系统学习到的语言模型中的差异，来产生比最佳单个系统更高的准确性。我们通过在三个不同的已标记语料库的基础上，通过涉及形态句法词类标记任务的实验来做到这一点。在相同的语料库数据上训练了四个著名的标记生成器（隐马尔可夫模型，基于内存的转换规则和最大熵）。比较之后，它们的输出使用几种投票策略和第二阶段分类器进行合并。所有组合标记器均胜过其最佳组件。错误率的降低因所涉及的材料而异，但LOB语料库的错误率可能高达24.3％。

著录项

来源
《Computational linguistics》 |2001年第2期|共31页
作者
Hans van Halteren; Jakub Zavrel; Walter Daelemans;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Improving Accuracy in Word Class Tagging through the Combination of Machine Learning Systems [J] . Hans van Halteren, Jakub Zavrel, Walter Daelemans Computational linguistics . 2001,第2期

机译：结合机器学习系统提高词类标注的准确性
2. Hash Transformation and Machine Learning-Based Decision-Making Classifier Improved the Accuracy Rate of Automated Parkinson’s Disease Screening [J] . IEEE transactions on neural systems and rehabilitation engineering . 2020,第1期

机译：哈希转换和基于机器学习的决策分类器提高了帕金森病自动筛查的准确率
3. Improving Classification Accuracy Using Hybrid of Extreme Learning Machine and Artificial Algae Algorithm with Multi-Light Source [J] . Devarajan D., Raj R. Joshua Samuel International journal of uncertainty, fuzziness and knowledge-based systems . 2020,第2期

机译：利用多光源混合使用极端学习机和人工藻类算法的分类准确性
4. Improving Data Driven Wordclass Tagging by System Combination [C] . Hans van Halteren, Jakub Zarel, Walter Daelemans Annual meeting of the Association for Computational Linguistics;International conference on computational linguistics;ICCL . 1998

机译：通过系统组合改进数据驱动的Wordclass标记
5. Improving the Autocoding of Injury Narratives Using a Combination of Machine Learning Methods and Natural Language Processing Techniques [D] . Nanda, Gaurav. 2017

机译：结合机器学习方法和自然语言处理技术来改进伤害性叙述的自动编码
6. A principled machine learning framework improves accuracy of stage II colorectal cancer prognosis [O] . Neofytos Dimitriou, Ognjen Arandjelović, David J. Harrison, 2018

机译：有原则的机器学习框架可提高II期大肠癌预后的准确性
7. Improving Accuracy in Wordclass Tagging through Combination of Machine Learning Systems [O] . Hans Van Halteren, Jakub Zavrel, Walter Daelemans 2000

机译：通过机器学习系统的结合提高词类标注的准确性

Improving Accuracy in Word Class Tagging through the Combination of Machine Learning Systems

摘要

著录项

相似文献

相关主题

期刊订阅