首页> 外文会议>International Forum on Information Technology and Applications >A Pragmatic Approach to Increase Accuracy of Chinese Word-Segmentation
【24h】

A Pragmatic Approach to Increase Accuracy of Chinese Word-Segmentation

机译:一种提高汉字分割准确性的务实方法

获取原文

摘要

Chinese word segmentation is important for understanding and dealing with Chinese natural language, and it is also a important part of search engineer, text retrieval, speech recognition, automatic translation. Chinese word segmentation is challenging because there is no space or physical means to mark the boundaries of words. It is often difficult to define what constitutes a word in Chinese. Currently, we have not yet fully mature and practical-oriented available Chinese word segmentation system, especially in the word-segmentation accuracy. This article presents a pragmatic approach to Chinese word segmentation to increase the word-segmentation accuracy. We introduce the combining mechanism of hybrid dictionary and universal dictionary, we design the practical data structure and describe this word segmentation algorithm, and give the test results.
机译:中文的细分对于了解和处理中国自然语言非常重要,它也是搜索工程师,文本检索,语音识别,自动翻译的重要组成部分。汉字分割是具有挑战性的,因为没有空间或物理意味着标记单词的界限。通常很难定义中文中的一个词。目前,我们还没有完全成熟和以实用的汉字分割系统,尤其是单词分割准确性。本文提出了一种务实的中文分割方法,以提高词段分割准确性。我们介绍了混合词典和通用词典的组合机制,我们设计了实际数据结构并描述了该词分段算法,并给出了测试结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号