A pitch frequency is a most typical parameter in a speech signal. So far, many methods extracting the pitch have been proposed. However, it is considered that there is not a complete pitch extraction method. Although it is not a complete, we propose a simple pitch extraction approach in order to apply analysis and synthesis of emotional speech. To express emotional information from "natural voice", pitch pattern control and amplitude control techniques are required. The proposed approach has two stages. The first stage finds roughly a frequency band in which the pitch will exist. This band is considered as a candidate that includes the pitch. The detailed frequency of the pitch is obtained in the second stage by DFT analysis of the candidate frequency band. Simulation results show that this approach can get the pitch with a simple procedure and the result of the proposed method is similar to that of the auto-correlation of the residual of a linear prediction.
展开▼