首页> 外文会议>International Conference on Asian Language Processing >The Comparative Research on the Segmentation Strategies of Tibetan Bounded-Variant Forms
【24h】

The Comparative Research on the Segmentation Strategies of Tibetan Bounded-Variant Forms

机译:藏有界型形式分割策略的比较研究

获取原文

摘要

The segmentation of Tibetan bounded-variant forms (TBVFS) is one of the most foundational tasks in text processing and the segmenting results directly influence the word segmentation, portaging, syntactic parsing and the Named Entity Extraction and so on. At present, the segmenting results are unsatisfactory and cannot be applied in practice. In this article, authors firstly describe the features of TBVFS, their distributions and then test the segmenting results by using two different segmentation strategies and conclude that Statistics-based methods for morpheme position tagging is better than Rule-based methods. If some rules are used to adjust a part of mistaken segmentations in the post processing, this kind of segmentation problem can be resolved.
机译:西藏有界 - 变体形式(TBVF)的分割是文本处理中最大的基础任务之一,分段结果直接影响单词分割,移植,句法解析和命名实体提取等。 目前,分段结果不令人满意,不能在实践中应用。 在本文中,作者首先描述了TBVFS,其分布的特征,然后通过使用两个不同的分割策略来测试分段结果,并得出结论,语音位置标记的基于统计数据的方法优于基于规则的方法。 如果某些规则用于调整后处理中的错误分段部分,则可以解决这种分割问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号