首页>
外国专利>
MULTI-SPEAKER DIARIZATION OF AUDIO INPUT USING A NEURAL NETWORK
MULTI-SPEAKER DIARIZATION OF AUDIO INPUT USING A NEURAL NETWORK
展开▼
机译:使用神经网络的音频输入的多扬声器日复速度
展开▼
页面导航
摘要
著录项
相似文献
摘要
An audio analysis platform may receive a portion of an audio input, wherein the audio input corresponds to audio associated with a plurality of speakers. The audio analysis platform may process, using a neural network, the portion of the audio input to determine voice activity of the plurality of speakers during the portion of the audio input, wherein the neural network is trained using reference audio data and reference diarization data corresponding to the reference audio data. The audio analysis platform may determine, based on the neural network being used to process the portion of the audio input, a diarization output associated with the portion of the audio input, wherein the diarization output indicates individual voice activity of the plurality of speakers. The audio analysis platform may provide the diarization output to indicate the individual voice activity of the plurality of speakers during the portion of the audio input.
展开▼