There is provided a call voice processing apparatus including an input correction unit that corrects characteristics of a first input sound input from a first input apparatus to characteristics of a second input sound input from a second input apparatus, a sound separation unit that separates the second input sound into a plurality of sounds, a sound type estimation unit that estimates sound types of the plurality of sounds separated by the sound separation unit, a mixing ratio calculation unit that calculates a mixing ratio of each sound in accordance with the sound type estimated by the sound type estimation unit, a sound mixing unit that mixes the plurality of sounds separated by the sound separation unit in the mixing ratio calculated by the mixing ratio calculation unit, and an extraction unit that extracts a specific sound from the first input sound corrected by the input correction unit.
展开▼