首页>
外国专利>
Module for processing an audio-video stream associating the spoken words with the corresponding faces
Module for processing an audio-video stream associating the spoken words with the corresponding faces
展开▼
机译:用于处理与相应面部说出口语的音频视频流的模块
展开▼
页面导航
摘要
著录项
相似文献
摘要
The invention relates to a module for processing an audio-video stream comprising: an audio segmentation sub-module (31) capable of identifying human words in the audio data of said audio-video stream and of dissociating those spoken simultaneously by different people; - a face detection submodule (32) capable of identifying the human faces present in the video data of said audio-video stream; and- an association sub-module (33) capable of associating the words spoken by each person with their face; characterized in that said association sub-module (33) implements an algorithm using a neural network of deep learning previously trained from a learning base. The invention also relates to a personal digital assistant with voice control for a motor vehicle incorporating such a processing module as well as a method for detecting and applying in a contextualized manner voice control requests in a motor vehicle. Figure to be published with the abstract: Fig. 1
展开▼