A voice section determination method including determining, for each of a plurality of sound frames, whether each of the plurality of sound frames corresponds to an utterance section, calculating a background noise for a target sound frame in the plurality of sound frames based on the plurality of sound frames prior to the target sound frame, the plurality of sound frames being included in a silence section, calculating a signal-to-noise ratio by using the calculated background noise, determining which does the target sound frame correspond to a first sound section of a first sound, or a second sound section of a second sound, the second sound being generated by transforming the first sound, and when the target sound frame is determined to correspond to the first sound section, determining whether the target sound frame corresponds to a voice section based on a pitch gain.
展开▼