Question classification plays an important role in cross-language question answering (CLQA) systems, while question Informer plays a key role in enhancing question classification for factual question answering. In this paper, we propose an integrated Genetic Algorithm (GA) and Machine Learning (ML) approach for question classification in English-Chinese cross-language question answering. To enhance question informer prediction, we use a hybrid method that integrates GA and Conditional Random Fields (CRF) to optimize feature subset selection in a CRF-based question informer prediction model. The proposed approach extends cross-language question classification by using the GA-CRF question informer feature with Support Vector Machines (SVM). The results of evaluations on the NTCIR-6 CLQA question sets demonstrate the efficacy of the approach in improving the accuracy of question classification in English-Chinese cross-language question answering.
展开▼