Using Stanford Part-of-Speech Tagger for the Morphologically-rich Filipino Language

机译：将Stanford词性标注器用于形态丰富的菲律宾语

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This research focuses on the implementation of a Maximum Entropy-based Part-of-Speech (POS) tagger for Filipino. It uses the Stanford POS tagger - a trainable POS tagger that has been trained on English, Chinese, Arabic, and other languages and producing one of the highest results in each language. The tagger was trained for Filipino using a 406k token corpus and considering unique Filipino linguistic phenomena such as high morphology and intra-sentential code-switches. The Filipino POS tagger resulted to 96.15% tagging accuracy which currently presents the highest accuracy and with a large lead among existing POS taggers for Filipino.

机译：这项研究的重点是针对菲律宾语的基于最大熵的词性（POS）标记器的实现。它使用Stanford POS标记器-一种可训练的POS标记器，已经过英语，中文，阿拉伯语和其他语言的培训，并且在每种语言中产生的结果最高。标记器使用406k令牌语料库进行了菲律宾语培训，并考虑了独特的菲律宾语言现象，例如高形态和句内代码转换。菲律宾POS标记器的标记准确率达到96.15％，这目前是最高的准确性，在菲律宾现有POS标记器中具有领先优势。

著录项

来源
《Pacific Asia Conference on Language, Information and Computation》|2017年|81-88|共8页
会议地点
作者
Matthew Phillip V. Go; Nicco Nocon;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Exploiting languages proximity for part-of-speech tagging of three French regional languages [J] . Magistry Pierre, Ligozat Anne-Laure, Rosset Sophie Language Resources and Evaluation . 2019,第4期

机译：利用语言邻近性对三种法语区域语言进行词性标记
2. Novel Text Steganography Using Natural Language Processing and Part-of-Speech Tagging [J] . Banik Barnali Gupta, Bandyopadhyay Samir Kumar IETE Journal of Research . 2020,第3期

机译：使用自然语言处理和致辞标记的新颖文本隐写
3. Improving accuracy of Part-of-Speech (POS) tagging using hidden markov model and morphological analysis for Myanmar Language [J] . Dim Lam Cing, Khin Mar Soe International Journal of Electrical and Computer Engineering . 2020,第2期

机译：使用隐马尔可夫模型和缅甸语言的形态分析提高语音部分（POS）标记的准确性
4. Using Stanford Part-of-Speech Tagger for the Morphologically-rich Filipino Language [C] . Matthew Phillip V. Go, Nicco Nocon Pacific Asia Conference on Language, Information and Computation . 2018

机译：使用斯坦福代表言语标记为形态学上富有的菲律宾语言
5. Reducing pipeline error propagation in natural language processing: Part-of-speech tagging applied to clinical narratives [D] . Ferraro, Jeffrey Page 2013

机译：减少自然语言处理中的管道误差传播：应用于临床叙述的语音标记
6. Improving performance of natural language processing part-of-speech tagging on clinical narratives through domain adaptation [O] . Jeffrey P Ferraro, Hal Daumé III, Scott L DuVall, 2013

机译：通过领域适应提高自然语言处理词性标注在临床叙事上的表现
7. Weakly supervised part-of-speech tagging for morphologically-rich, resource-scarce languages [O] . Kazi Saidul Hasan, Vincent Ng 2009

机译：对形态丰富，资源稀缺的语言的弱监督词性标记

Using Stanford Part-of-Speech Tagger for the Morphologically-rich Filipino Language

摘要

著录项

相似文献

相关主题

期刊订阅