摘要:
为了提升汉语词汇测试的命题效率,该文从汉语语言特性和二语教学需求出发,对词语听力、多空词语选择、词语排序和单空词语选择四种词汇测试题型进行自动命题尝试,以满足不同语言信息、不同难度的词汇知识考查.在词语特征的提取上,构建了一个覆盖词音、词形、词义、语法、搭配、偏误各层次信息的词汇知识库,在句子特征的提取上,实现了语法项目自动识别、句子难度分析等算法,为自动命题中的题干句、目标词和干扰项选择提供依据.通过词句选择和语块合成等步骤,生成四种题型共计7 263道词汇测试题.人工测试数据显示,词汇测试自动命题的初步尝试取得了较好的效果,约58%的试题被评价为完全合理,经人工简单调整,试题接受率达到75.7%.%This paper discusses the automatic generation strategy of four types of vocabulary test questions:word listening,multi-word selection,word order and single word selection..A knowledge base is built to extract word-level features including pronunciation,senses,grammars,collocations,learners' errors,etc.Sentence analysis modules are also developed for automatic identification of grammatical constructions and the estimation of sentence difficulty degrees.By selecting proper sentences,target words and distractors,7263 vocabulary test questions are automatically generated in the experiment.The manual evaluation shows that the automatic generation strategy performs well with 58% of the questions evaluated as completely reasonable.After slight manual modification,the question acceptance rate is increased to 75.7%.