The present disclosure relates to a system and method for generating a data set for learning a machine-reading based query response system. To this end, a method for generating learning data includes: performing language processing on a text to be learned; Receiving a set of questions and correct answers related to the text; Specifying a position of a sentence related to the question and a position of a sentence related to the correct answer in the text; And verifying the validity of the set of questions and correct answers based on whether a difference between the position of the sentence related to the question and the position of the sentence related to the correct answer is equal to or greater than a preset value.
展开▼