Semi-supervised Learning for Mongolian Morphological Segmentation

机译：半监督蒙古形态细分学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Unlike previous Mongolian morphological segmentation methods based on large labeled training data or complicated rules concluded by linguists, we explore a novel semi-supervised method for a practical application, i.e., statistical machine translation (SMT), based on a low-resource learning setting, in which a small amount of labeled data and large amount of unlabeled data are available. First, a CRF-based supervised learning is exploited to predict morpheme boundaries by using small labeled data. Then, a lexicon-based segmentation model with small labeled data as the heuristic information is used to compensate the weakness in the first step by the abundant unlabeled data. Finally, we present some error correction models to revise segmentation results. Experimental results show that our method can improve the segmentation results compared with the pure supervised learning. Besides, we integrate the morphological segmentation result into Chinese-Mongolian SMT and achieve the satisfactory performance compared with the baseline.

机译：与基于语言学家得出的大型标签培训数据或复杂规则的先前蒙古形态分割方法不同，我们探索了一种基于低资源学习环境的实际应用，即统计机器翻译（SMT）的新型半监督方法，其中可以使用少量标记数据和大量的未标记数据。首先，利用基于CRF的监督学习来通过使用小标记数据来预测语素边界。然后，使用具有小标记数据的基于词汇的分割模型作为启发式信息，用于通过丰富的未标记数据来补偿第一步中的弱点。最后，我们提出了一些错误校正模型来修改分段结果。实验结果表明，与纯粹的监督学习相比，我们的方法可以改善分割结果。此外，与基线相比，我们将形态分割结果整合到蒙古SMT中，实现了令人满意的性能。

著录项

来源
《China National Conference on Computational Linguistics》|2016年|460p|共10页
会议地点
作者
Zhenxin Yang; Miao Li; Lei Chen; Weihui Zeng; Yi Gao; Sha Fu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
Semi-supervised learning; Morphological segmentation; Statistical machine translation; Low-resource language;

机译：半监督学习;形态分割;统计机器翻译;低资源语言;

相似文献

外文文献
中文文献
专利

1. A spatio-temporal latent atlas for semi-supervised learning of fetal brain segmentations and morphological age estimation [J] . DittrichE., RiklinRavivT., KasprianG., Medical image analysis . 2014,第1期

机译：时空潜伏图集，用于半监督学习的胎脑分割和形态年龄估计
2. A spatio-temporal latent atlas for semi-supervised learning of fetal brain segmentations and morphological age estimation [J] . DittrichE., RiklinRavivT., KasprianG., Medical image analysis . 2014,第1期

机译：一种用于半监督胎儿脑细分和形态年龄估计的半监督学习的时空潜在地图集
3. Uncertainty-aware temporal self-learning (UATS): Semi-supervised learning for segmentation of prostate zones and beyond [J] . Meyer Anneke, Ghosh Suhita, Schindele Daniel, Artificial intelligence in medicine . 2021,第Juna期

机译：不确定感知的时间自学（UATS）：半监督前列腺区分割的学习
4. Semi-supervised Learning for Mongolian Morphological Segmentation [C] . Zhenxin Yang, Miao Li, Lei Chen, China national conference on computational linguistics;International symposium on natural language processing based on naturally annotated big data . 2016

机译：半监督学习的蒙古语形态分割
5. Semi-Supervised Semantic Segmentation in UAV Imagery [D] . Vaidya, Ashlesha. 2020

机译：UAV Imagery中的半监督语义细分
6. Semi-Supervised Learning in Medical MRI Segmentation: Brain Tissue with White Matter Hyperintensity Segmentation Using FLAIR MRI [O] . ZunHyan Rieu, JeeYoung Kim, Regina EY Kim, 2021

机译：医学MRI分割中的半监督学习：使用Flair MRI的白质超强度分割脑组织
7. Semi-Supervised Learning in Medical Image Segmentation: Brain Tissue with White Matter Hyperintensity Segmentation Using FLAIR Image [O] . ZunHyan Rieu, Donghyeon Kim, JeeYoung Kim, 2021

机译：医学图像分割中的半监督学习：使用Flair图像与白质超强度分割的脑组织

Semi-supervised Learning for Mongolian Morphological Segmentation

摘要

著录项

相似文献

相关主题

期刊订阅