...
首页> 外文期刊>Applied Computer Systems >Analysing the Methods of Dzongkha Word Segmentation
【24h】

Analysing the Methods of Dzongkha Word Segmentation

机译:宗喀语分词方法分析

获取原文

摘要

In both Chinese and Dzongkha languages, the greatest challenge is to identify the word boundaries because there are no word delimiters as it is in English and other Western languages. Therefore, preprocessing and word segmentation is the first step in Dzongkha language processing, such as translation, spell-checking, and information retrieval. Research on Chinese word segmentation was conducted long time ago. Therefore, it is relatively mature, but the Dzongkha word segmentation has been less studied by researchers. In the paper, we have investigated this major problem in Dzongkha language processing using a probabilistic approach for selecting valid segments with probability being computed on the basis of the corpus.
机译:在中文和宗卡语中,最大的挑战是识别单词边界,因为没有英语和其他西方语言中的单词分隔符。因此,预处理和分词是宗喀语语言处理(例如翻译,拼写检查和信息检索)的第一步。汉语分词的研究很久以前就进行了。因此,它比较成熟,但是研究人员对宗喀语分词的研究较少。在本文中,我们研究了宗喀语语言处理中的这一主要问题,使用概率方法来选择有效句段,并根据语料库计算出概率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号