An apparatus for summarizing a song sound of an MP3(MPEG-1(Moving Picture Experts Group-1) layer 3) format by using a music structure, a method therefor, and a recording medium where a program for embodying the same is stored are provided to generalize and define a typical structure for the sound of the MP3 format, extract a change in the typical structure, and automatically generate a summary for the sound of the MP3 format. An operation controller(201) receives an MP3 sound source, divides the received MP3 sound source by a granule of a certain time unit on the basis of a digital value for the corresponding sound source, collects the divided sound sources having the granule unit by a segment of a certain time unit, provides and controls the collected sound sources, and outputs and controls a summary sound source formed according to the time setup of a user. A summary generator(205) generates an item-classified feature value of the granule unit for the corresponding sound source, generates a feature vector of the segment unit on the basis of the item-classified feature value, extracts an introduction section, a verse section, and a chorus section for the corresponding sound source on the basis of the feature vector, and generates the summary sound source on the basis of the combination of the sections and summary time information selected by the user.
展开▼