Semi-supervised Learning for Mongolian Morphological Segmentation

机译：半监督学习的蒙古语形态分割

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Unlike previous Mongolian morphological segmentation methods based on large labeled training data or complicated rules concluded by linguists, we explore a novel semi-supervised method for a practical application, i.e., statistical machine translation (SMT), based on a low-resource learning setting, in which a small amount of labeled data and large amount of unlabeled data are available. First, a CRF-based supervised learning is exploited to predict morpheme boundaries by using small labeled data. Then, a lexicon-based segmentation model with small labeled data as the heuristic information is used to compensate the weakness in the first step by the abundant unlabeled data. Finally, we present some error correction models to revise segmentation results. Experimental results show that our method can improve the segmentation results compared with the pure supervised learning. Besides, we integrate the morphological segmentation result into Chinese-Mongolian SMT and achieve the satisfactory performance compared with the baseline.

机译：与以前的蒙古语形态学分割方法基于大量标注的训练数据或语言学家得出的复杂规则不同，我们探索了一种新颖的半监督方法进行实际应用，即基于资源匮乏的学习环境的统计机器翻译（SMT），其中少量的标记数据和大量的未标记数据是可用的。首先，利用基于CRF的监督学习通过使用小的标记数据来预测词素边界。然后，使用带有小标记数据作为启发式信息的基于词典的分割模型，以第一步通过大量未标记数据来补偿弱点。最后，我们提出了一些误差校正模型来修正分割结果。实验结果表明，与纯监督学习相比，该方法可以提高分割效果。此外，我们将形态分割结果整合到了中蒙SMT中，与基线相比取得了令人满意的性能。

著录项

来源
《China national conference on computational linguistics;International symposium on natural language processing based on naturally annotated big data》|2016年|143-152|共10页
会议地点
作者
Zhenxin Yang; Miao Li; Lei Chen; Weihui Zeng; Yi Gao; Sha Fu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Semi-supervised learning; Morphological segmentation; Statistical machine translation; Low-resource language;

机译：半监督学习;形态分割统计机器翻译;资源匮乏的语言;

相似文献

外文文献
中文文献
专利

1. A spatio-temporal latent atlas for semi-supervised learning of fetal brain segmentations and morphological age estimation [J] . DittrichE., RiklinRavivT., KasprianG., Medical image analysis . 2014,第1期

机译：时空潜伏图集，用于半监督学习的胎脑分割和形态年龄估计
2. A spatio-temporal latent atlas for semi-supervised learning of fetal brain segmentations and morphological age estimation [J] . DittrichE., RiklinRavivT., KasprianG., Medical image analysis . 2014,第1期

机译：一种用于半监督胎儿脑细分和形态年龄估计的半监督学习的时空潜在地图集
3. Uncertainty-aware temporal self-learning (UATS): Semi-supervised learning for segmentation of prostate zones and beyond [J] . Meyer Anneke, Ghosh Suhita, Schindele Daniel, Artificial intelligence in medicine . 2021,第Juna期

机译：不确定感知的时间自学（UATS）：半监督前列腺区分割的学习
4. Semi-supervised Learning for Mongolian Morphological Segmentation [C] . Zhenxin Yang, Miao Li, Lei Chen, China National Conference on Computational Linguistics . 2016

机译：半监督蒙古形态细分学习
5. Semi-Supervised Semantic Segmentation in UAV Imagery [D] . Vaidya, Ashlesha. 2020

机译：UAV Imagery中的半监督语义细分
6. Semi-Supervised Learning in Medical MRI Segmentation: Brain Tissue with White Matter Hyperintensity Segmentation Using FLAIR MRI [O] . ZunHyan Rieu, JeeYoung Kim, Regina EY Kim, 2021

机译：医学MRI分割中的半监督学习：使用Flair MRI的白质超强度分割脑组织
7. Semi-Supervised Learning in Medical Image Segmentation: Brain Tissue with White Matter Hyperintensity Segmentation Using FLAIR Image [O] . ZunHyan Rieu, Donghyeon Kim, JeeYoung Kim, 2021

机译：医学图像分割中的半监督学习：使用Flair图像与白质超强度分割的脑组织

Semi-supervised Learning for Mongolian Morphological Segmentation

摘要

著录项

相似文献

相关主题

期刊订阅