首页> 外文期刊>ACM transactions on Asian language information processing >Word Segmentation for Burmese Based on Dual-Layer CRFs
【24h】

Word Segmentation for Burmese Based on Dual-Layer CRFs

机译:基于双层CRF的缅甸语分词

获取原文
获取原文并翻译 | 示例

摘要

Burmese is an isolated language, in which the syllable is the smallest unit. Syllable segmentation methods based on matching lead to performance subject to the syllable segmentation effect. This article proposes a word segmentation method with fusion conditions of double syllable features. It combines word segmentation and segmentation of syllables into one process, thus reducing the impact of errors on the syllable segmentation of Burmese. In the first layer of the conditional random fields (CRF) model, Burmese characters as atomic features are integrated into the Burma section of the Barkis Speech Paradigm (Backus normal form) features to realize the Burma syllable sequence tags. In the second layer of the CRFs model, with the syllable marked as input, it realizes the sequence markers through building a feature template with syllables as atomic features. The experimental results show that the proposed method has a better effect compared with the method based on the matching of syllables.
机译:缅甸语是一种孤立的语言,其中的音节是最小的单元。基于匹配的音节分割方法会导致性能受到音节分割效果的影响。本文提出了一种具有双音节特征融合条件的分词方法。它将单词分割和音节分割合并为一个过程,从而减少了错误对缅甸语音节分割的影响。在条件随机场(CRF)模型的第一层中,作为原子特征的缅甸字符被集成到Barkis语音范式(Backus正常形式)特征的缅甸部分中,以实现缅甸音节序列标签。在CRF模型的第二层中,将音节标记为输入,它通过构建以音节为原子特征的特征模板来实现序列标记。实验结果表明,与基于音节匹配的方法相比,该方法具有更好的效果。

著录项

  • 来源
  • 作者单位

    Kunming Univ Sci & Technol, Sch Informat Engn & Automat, Kunming 650500, Yunnan, Peoples R China;

    Kunming Univ Sci & Technol, Sch Informat Engn & Automat, Kunming 650500, Yunnan, Peoples R China;

    Kunming Univ Sci & Technol, Sch Informat Engn & Automat, Kunming 650500, Yunnan, Peoples R China;

    Kunming Univ Sci & Technol, Sch Informat Engn & Automat, Kunming 650500, Yunnan, Peoples R China;

    Kunming Univ Sci & Technol, Sch Informat Engn & Automat, Kunming 650500, Yunnan, Peoples R China;

    Kunming Univ Sci & Technol, Sch Informat Engn & Automat, Kunming 650500, Yunnan, Peoples R China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Burmese; word segmentation; CRFs; BNF; syllable segmentation;

    机译:缅甸语;分词;CRFs;BNF;音节分词;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号