Improving part-of-speech tagging using lexicalized HMMs

FERRAN PLA; ANTONIO MOLINA

首页> 外文期刊>Natural language engineering >Improving part-of-speech tagging using lexicalized HMMs

【24h】

Improving part-of-speech tagging using lexicalized HMMs

机译：使用词法化的HMM改进词性标记

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We introduce a simple method to build Lexicalized Hidden Markov Models (L-HMMs) for improving the precision of part-of-speech tagging. This technique enriches the contextual Language Model taking into account a set of selected words empirically obtained. The evaluation was conducted with different lexicalization criteria on the Penn Treebank corpus using the TnT tagger. This lexicalization obtained about a 6% reduction of the tagging error, on an unseen data test, without reducing the efficiency of the system. We have also studied how the use of linguistic resources, such as dictionaries and morphological analyzers, improves the tagging performance. Furthermore, we have conducted an exhaustive experimental comparison that shows that Lexicalized HMMs yield results which are better than or similar to other state-of-the-art part-of-speech tagging approaches. Finally, we have applied Lexicalized HMMs to the Spanish corpus LexEsp.

机译：我们引入了一种简单的方法来构建词法化隐式马尔可夫模型（L-HMM），以提高词性标注的精度。考虑到根据经验获得的一组选定单词，此技术丰富了上下文语言模型。评估是使用TnT标签在Penn Treebank语料库上使用不同的词汇化标准进行的。在看不见的数据测试中，这种词汇化使标记错误减少了大约6％，而不会降低系统的效率。我们还研究了如何使用语言资源（例如词典和词法分析器）来提高标记性能。此外，我们进行了详尽的实验比较，表明Lexicalized HMM产生的结果优于或类似于其他最新的词性标记方法。最后，我们将词法化的HMM应用到了西班牙语料库LexEsp。

著录项

来源
《Natural language engineering》 |2004年第6期|p.167-189|共23页
作者
FERRAN PLA; ANTONIO MOLINA;
展开▼
作者单位

Departament de Sistemes Informatics i Computacio, Universitat Politecnica de Valencia, Cami de Vera, s. 46020 Valencia SPAIN;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. A lexicalized second-order-HMM for ambiguity resolution in Chinese segmentation and POS tagging [J] . Chen Yin, Yang Muyun, Zhao Tiejun High Technology Letters . 2005,第4期

机译：词汇化的二阶HMM用于中文细分和POS标签中的歧义解析
2. span xmlns="http://www.wiley.com/namespaces/wiley" cssStyle="font-family:monospace">HMMoce/span>HMMoce : An span xmlns="http://www.wiley.com/namespaces/wiley" cssStyle="font-family:monospace">R/span>R package for improved geolocation of archival‐tagged fishes using a hidden Markov method [J] . Braun Camrin D., Galuardi Benjamin, Thorrold Simon R., Methods in Ecology and Evolution . 2018,第5期

机译：＆ span xmlns =“http://www.wiley.com/namespaces/wiley”cssstyle =“font-family：monospace”> hmmoce＆ / span> hmmoce：一个＆ span xmlns =“http：// www。 wiley.com/namespaces/wiley“cssstyle =”font-family：monospace“> r＆ / span> R包，用于使用隐藏的马尔可夫方法改进档案标记的鱼类的地理位置
3. Improving accuracy of Part-of-Speech (POS) tagging using hidden markov model and morphological analysis for Myanmar Language [J] . Dim Lam Cing, Khin Mar Soe International Journal of Electrical and Computer Engineering . 2020,第2期

机译：使用隐马尔可夫模型和缅甸语言的形态分析提高语音部分（POS）标记的准确性
4. A hybrid PSO-Viterbi algorithm for HMMs parameters weighting in Part-of-Speech tagging [C] . Sun Shichang, Lin Hongfei, Liu Hongbo 2011 International Conference of Soft Computing and Pattern Recognition . 2011

机译：语音部分标记中HMM参数加权的混合PSO-Viterbi算法
5. IITagger: Tagging Wall Street Journal text with part-of-speech information [D] . Kim, Yeongkwun 1996

机译：IITagger：使用词性信息标记“华尔街日报”文本
6. Improving performance of natural language processing part-of-speech tagging on clinical narratives through domain adaptation [O] . Jeffrey P Ferraro, Hal Daumé III, Scott L DuVall, 2013

机译：通过领域适应提高自然语言处理词性标注在临床叙事上的表现
7. Improving a simple bigram hmm part-of-speech tagger by latent annotation and selftraining [O] . Zhongqiang Huang, Vladimir Eidelman, Mary Harper 2009

机译：通过潜在注释和自我训练改进简单的bigram hmm词性标记器

Improving part-of-speech tagging using lexicalized HMMs

摘要

著录项

相似文献

相关主题

期刊订阅