首页> 外文会议>International Conference on Statistical Language and Speech Processing >Unsupervised Morphological Segmentation Using Neural Word Embeddings
【24h】

Unsupervised Morphological Segmentation Using Neural Word Embeddings

机译:使用神经词嵌入的无监督的形态分割

获取原文

摘要

We present a fully unsupervised method for morphological segmentation. Unlike many morphological segmentation systems, our method is based on semantic features rather than orthographic features. In order to capture word meanings, word embeddings are obtained from a two-level neural network [11]. We compute the semantic similarity between words using the neural word embeddings, which forms our baseline segmentation model. We model morphotactics with a bigram language model based on maximum likelihood estimates by using the initial segmentations from the baseline. Results show that using semantic features helps to improve morphological segmentation especially in agglutinating languages like Turkish. Our method shows competitive performance compared to other unsupervised morphological segmentation systems.
机译:我们介绍了一种完全无监督的形态细分方法。与许多形态分割系统不同,我们的方法基于语义特征而不是正交特征。为了捕获词含义,从两级神经网络获得单词嵌入式[11]。我们使用神经单词嵌入式计算单词之间的语义相似性,这构成了我们的基线分段模型。我们使用基于基线的初始分段,使用基于最大似然估计的Bigram语言模型来模拟Morphoticics。结果表明,使用语义特征有助于改善形态分割,尤其是土耳其等凝聚语言。与其他无监督的形态分割系统相比,我们的方法显示了竞争性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号