A method is provided for correcting a dictionary used in a voice processing apparatus. The method includes first extracting a speech of a target speaker from audio collected by a microphone, and estimating a speech phonemic sequence configuring the speech. The method also includes calculating a match degree, using a first dictionary, between the speech phonemic sequence and a first phonemic sequence that corresponds to a first word registered in the first dictionary, and second extracting the first word corresponding to a highest match degree as a spoken word spoken by the target speaker. The method further includes first correcting a second dictionary based on the highest match degree, the second dictionary indicating a relation between a second word and a third word, and second correcting the second dictionary by correcting the relation between the third word matching the spoken word and the second word.
展开▼