Conditional Random Fields for Korean Morpheme Segmentation and POS Tagging

SEUNG-HOON NA

首页> 外文期刊>ACM transactions on Asian language information processing >Conditional Random Fields for Korean Morpheme Segmentation and POS Tagging

【24h】

Conditional Random Fields for Korean Morpheme Segmentation and POS Tagging

机译：用于韩国语词素分割和POS标记的条件随机字段

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

There has been recent interest in statistical approaches to Korean morphological analysis. However, previous studies have been based mostly on generative models, including a hidden Markov model (HMM), without utilizing discriminative models such as a conditional random field (CRF). We present a two-stage discriminative approach based on CRFs for Korean morphological analysis. Similar to methods used for Chinese, we perform two disambiguation procedures based on CRFs: (1) morpheme segmentation and (2) POS tagging. In morpheme segmentation, an input sentence is segmented into sequences of morphemes, where a morpheme unit is either atomic or compound. In the POS tagging procedure, each morpheme (atomic or compound) is assigned a POS tag. Once POS tagging is complete, we carry out a post-processing of the compound morphemes, where each compound morpheme is further decomposed into atomic morphemes, which is based on pre-analyzed patterns and generalized HMMs obtained from the given tagged corpus. Experimental results show the promise of our proposed method.

机译：最近对韩语形态分析的统计方法产生了兴趣。但是，以前的研究主要基于生成模型，包括隐马尔可夫模型（HMM），而没有利用诸如条件随机场（CRF）之类的判别模型。我们提出了一种基于CRF的两阶段判别方法，用于韩国形态分析。类似于用于中文的方法，我们基于CRF执行两个消歧过程：（1）语素分割和（2）POS标记。在语素分割中，将输入句子分割成语素序列，其中语素单元是原子或化合物。在POS标记过程中，为每个语素（原子或化合物）分配了POS标记。 POS标记完成后，我们将对复合词素进行后处理，其中每个复合词素将进一步分解为原子词素，这是基于预先分析的模式和从给定标记语料库中获得的广义HMM的结果。实验结果表明了我们提出的方法的前景。

著录项

来源
《ACM transactions on Asian language information processing》 |2015年第3期|10.1-10.16|共16页
作者
SEUNG-HOON NA;
展开▼
作者单位

Busan University of Foreign Studies;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Conditional random fields; morpheme segmentation; POS tagging; Korean morphological analysis;

机译：条件随机字段;语素分割POS标签;韩国形态分析;

相似文献

外文文献
中文文献
专利

1. Phrase-Based Statistical Model for Korean Morpheme Segmentation and POS Tagging [J] . Seung-Hoon NA, Young-Kil KIM IEICE transactions on information and systems . 2018,第2期

机译：基于短语的韩国语词素分割和POS标签统计模型
2. Syllable-Pattern-Based Unknown-Morpheme Segmentation and Estimation for Hybrid Part-of-Speech Tagging of Korean [J] . Gary Geunbae Lee, Jeongwon Cha, Jong-Hyeok Lee Computational linguistics . 2002,第1期

机译：基于音节模式的未知语素分割和韩语混合词性标注的估计
3. A label fusion method using conditional random fields with higher-order potentials: Application to hippocampal segmentation [J] . Platero Carlos, Carmen Tobar M. Artificial intelligence in medicine . 2015,第2期

机译：使用具有更高阶电位的条件随机场的标签融合方法：在海马分割中的应用
4. POS Tagging in Amazighe Using Support Vector Machines and Conditional Random Fields [C] . Mohamed Outahajala, Yassine Benajiba, Paolo Rosso, Natural language processing and information systems . 2011

机译：使用支持向量机和条件随机场的Amazighe POS标记
5. A Semi-Automated Approach to Medical Image Segmentation using Conditional Random Field Inference [D] . ?Hu, Yu-chi 2020

机译：使用条件随机场推断进行半自动方法的医学图像分割方法
6. Left ventricular segmentation from MRI datasets with edge modelling conditional random fields [O] . Janto F Dreijer, Ben M Herbst, Johan A du Preez 2013

机译：带有边缘建模条件随机场的MRI数据集的左心室分割
7. Syllable-Pattern-Based Unknown- Morpheme Segmentation and Estimation for Hybrid Part-of-Speech Tagging of Korean [O] . Gary Geunbae, Lee Jeongwon Cha, Jong-hyeok Lee 2008

机译：基于音节模式的未知语素分割与韩语混合词性标注的估计

Conditional Random Fields for Korean Morpheme Segmentation and POS Tagging

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅