In Chinese there are the fix collocation relationships between words.This paper presents a disambiguation method for Chinese segmentation based on word collocation.It firstly pre-segments the sentences by using the forward maximum matching method and backward maximum matching method,and carries out the word ambiguity detection and tags the part of speech,and then it matches the ambiguous words with word collocation dictionary or makes distinguishment on verb-object collocations,thus achieves the more accurate results of document words disambiguation.The proposed method reaches good results as shown in contrast experiments of word ambiguity detection and word collo-cation detection.%汉语中词与词之间存在固定的搭配关系,基于词语搭配关系提出一种分词歧义性消除方法。该方法先利用正向和逆向最大匹配方法进行句子预切分,并对词的歧义性进行检测和词性标注,再对歧义词与词语搭配词典进行匹配或者动宾搭配判断,实现了较为准确的文档词语歧义性消除。通过词的歧义性检测实验和词语搭配检测对比实验,该方法取得了较好的效果。
展开▼