A system for answering multiple choice questions includes at least one processor configured to create a question answering model using a training data set. The system is configured to create a balanced data from the imbalanced training data set. The balancing of the imbalanced training data set is achieved by generating synthetic instances of at least one minority category, among a plurality of categories into which the training data set is categorized.
展开▼