固定短语的自动识别和标注是进行蒙古语文本处理的基础和前提条件.词类标注、短语标注、句法分析、语义分类及语义角色标注等基础研究和机器翻译、文本校对等应用系统的开发均以正确标注固定短语的文本为处理对象.该文在"蒙古语固定短语语法信息词典"的基础上采用基于有限状态自动机和规则的方法设计实现了固定短语识别和标注算法.经实验,其识别率已达到90%以上,在处理中,词均用时与基于字符串匹配的算法相比提高较多,达到0.0050ms.%Automatic identification and annotation of fixed phrases are esseential to the Mongolian text processing. On the basis of "Mongolian Fixed Phrase Grammatical Information Dictionary ",this paper designs and implements an algorithm for Mongolian fixed phrase recognition and labeling based on finite state automata and rules.Experi-ments reavel an recognition rate of more than 90%,and an average processing speed of 0.005 millisecond per word.
展开▼