首页> 外国专利> Module for processing an audio-video stream associating the spoken words with the corresponding faces

Module for processing an audio-video stream associating the spoken words with the corresponding faces

机译:用于处理与相应面部说出口语的音频视频流的模块

摘要

The invention relates to a module for processing an audio-video stream comprising: an audio segmentation sub-module (31) capable of identifying human words in the audio data of said audio-video stream and of dissociating those spoken simultaneously by different people; - a face detection submodule (32) capable of identifying the human faces present in the video data of said audio-video stream; and- an association sub-module (33) capable of associating the words spoken by each person with their face; characterized in that said association sub-module (33) implements an algorithm using a neural network of deep learning previously trained from a learning base. The invention also relates to a personal digital assistant with voice control for a motor vehicle incorporating such a processing module as well as a method for detecting and applying in a contextualized manner voice control requests in a motor vehicle. Figure to be published with the abstract: Fig. 1
机译:本发明涉及一种用于处理音频 - 视频流的模块,包括:音频分割子模块(31),其能够在所述音频 - 视频流的音频数据中识别人类单词,并且通过不同的人同时解散那些所说的那些语言; - 能够识别所述音频 - 视频流的视频数据中存在的人面的面部检测子模块(32);和 - 能够将每个人口头所说的单词与脸部相关联的关联子模块(33);其特征在于,所述关联子模块(33)使用先前从学习基地训练的深度学习的神经网络实现了一种算法。 本发明还涉及一种具有用于结合这样的处理模块的机动车辆的语音控制的个人数字助理以及用于以机动车辆中的语境化方式检测和施加的方法。 用摘要发布图:图1

著录项

  • 公开/公告号FR3103598A1

    专利类型

  • 公开/公告日2021-05-28

    原文格式PDF

  • 申请/专利权人 PSA AUTOMOBILES SA;

    申请/专利号FR1913006

  • 发明设计人 THIBAULT FOUQUERAY;THOMAS HANNAGAN;

    申请日2019-11-21

  • 分类号G06K9;G10L15/24;G10L17/02;G06N3/02;B60W50/10;

  • 国家 FR

  • 入库时间 2022-08-24 19:02:22

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号