首页> 外文会议>2010 International Conference on Asian Language Processing >A Dictionary Mechanism for Chinese Word Segmentation Based on the Finite Automata
【24h】

A Dictionary Mechanism for Chinese Word Segmentation Based on the Finite Automata

机译:基于有限自动机的汉语分词词典机制

获取原文

摘要

Dictionary mechanism is the basis of Chinese word segmentation, and its quality directly affects the speed and efficiency of Chinese word segmentation. In existing dictionary mechanisms, there are such shortages as space wasting, low efficiency, and difficult maintenance, and therefore, how to establish an effective mechanism is an urgent problem for Chinese word segmentation. In this paper, the idea of finite-state automaton is firstly studied, then a new kind of dictionary mechanism is proposed to save space and improve the speed of Chinese word segmentation as possible, and finally, the performances of various dictionary mechanisms are analyzed with theoretical study and experimental comparison. The result shows that compared with other mechanisms, the dictionary mechanism based on finite-state automaton proposed in the paper improves in space complexity and time complexity.
机译:词典机制是中文分词的基础,其质量直接影响中文分词的速度和效率。在现有的词典机制中,存在空间浪费,效率低,维护困难等问题,因此,如何建立有效的机制是汉语分词的迫切问题。本文首先研究了有限状态自动机的思想,然后提出了一种新型的字典机制,以节省空间并尽可能提高中文分词的速度,最后,分析了各种字典机制的性能。理论研究和实验比较。结果表明,与其他机制相比,本文提出的基于有限状态自动机的字典机制在空间复杂度和时间复杂度上均有所提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号