This disclosure relates generally to speech signal processing, and more particularly to a method and system for processing speech signal for emotion identification. The system processes a speech signal collected as input, during which a plurality of differential features corresponding to a plurality of frames of the speech signal are extracted. Further, the differential features are compared with an emotion recognition model to identify at least one emotion matching the speech signal, and then the at least one emotion is associated with the speech signal.
展开▼