Part-of-Speech Tagger Based on Maximum Entropy Model

机译：基于最大熵模型的词性标注

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The maximum entropy (ME) conditional models don’t force to adhere to the independence assumption such as in Hidden Markov generative models, and thus the ME-based Part-of-Speech (POS) tagger can depend on arbitrary, nonindependent features, which are benefit to the POS tagging, without accounting for the distribution of those dependencies. Since ME models are able to flexibly utilize a wide variety of features, the sparse problem of training data is efficiently solved. Experiments show that the POS tagging error rate is reduced by 54.25% in close test and 40.56% in open test over the Hidden-Markov-Model-based baseline, and synchronously an accuracy of 98.01% in close test and 95.56% in open test are obtained.

机译：最大熵（ME）条件模型不会强迫遵守独立性假设，例如在隐马尔可夫生成模型中，因此基于ME的词性（POS）标记器可以依赖于任意的，非独立的特征，这些特征对POS标记有好处，而无需考虑这些依赖项的分布。由于ME模型能够灵活地利用多种功能，因此有效地解决了训练数据的稀疏问题。实验表明，与基于隐马尔可夫模型的基准相比，封闭测试中的POS标记错误率降低了54.25％，开放测试中的POS标记错误率降低了40.56％，同步地，封闭测试中的准确性为98.01％，开放测试中的准确性为95.56％。获得。

著录项

来源
《IEEE international conference on computer science and information technology;ICCSIT 2009》|2009年|p.1342-1345|共4页
会议地点
作者
HUANG Heyan; ZHANG Xiaofei;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
Natural Language Processing (NLP); POS tagging; ME model; Hidden Markov model (HMM);

机译：自然语言处理（NLP）; POS标记; ME模型;隐马尔可夫模型（HMM）;

相似文献

外文文献
中文文献
专利

1. Automatic Part-of-speech Tagging for Oromo Language Using Maximum Entropy Markov Model (MEMM) [J] . Abraham Tesso Nedjo, Degen Huang, Xiaoxia Liu Journal of information and computational science . 2014,第10期

机译：使用最大熵马尔可夫模型（MEMM）的Oromo语言的自动词性标记
2. Chinese Text Similarity Algorithm Based on Part-of-Speech Tagging and Word Vector Model [J] . Zhixin Ma, Mengguang Li Journal of Computers . 2019,第4期

机译：基于词性标注和词向量模型的中文文本相似度算法
3. Part-of-speech tagger for Ainu language based on higher order Hidden Markov Model [J] . Michal Ptaszynski, Yoshio Momouchi Expert Systems with Application . 2012,第14期

机译：基于高阶隐马尔可夫模型的阿伊努语词性标注器
4. Part-of-speech tagger based on maximum entropy model [C] . IEEE International Conference on Computer Science and Information Technology . 2009

机译：基于最大熵模型的语音标记
5. Statistical machine translation: Maximum entropy based translation models and search algorithms. [D] . Garcia Varea, Ismael. 2003

机译：统计机器翻译：基于最大熵的翻译模型和搜索算法。
6. Joint Modeling of Multiple Social Networks to Elucidate Primate Social Dynamics: I. Maximum Entropy Principle and Network-Based Interactions [O] . Stephanie Chan, Hsieh Fushing, Brianne A. Beisner, 2010

机译：多个社交网络的联合模型，以阐明灵长类动物的社会动力学：I.最大熵原理和基于网络的互动
7. Enriching the knowledge sources used in a maximum entropy part-of-speech tagger [O] . Kristina Toutanova, Christopher D. Manning 2000

机译：丰富最大熵词性标注器中使用的知识源

Part-of-Speech Tagger Based on Maximum Entropy Model

摘要

著录项

相似文献

相关主题

期刊订阅