首页> 外国专利> DETERMINING WHEN A SUBJECT IS SPEAKING BY ANALYZING A RESPIRATORY SIGNAL OBTAINED FROM A VIDEO

DETERMINING WHEN A SUBJECT IS SPEAKING BY ANALYZING A RESPIRATORY SIGNAL OBTAINED FROM A VIDEO

机译：通过分析从视频中获得的呼吸信号来确定对象说话的时间

页面导航

摘要
著录项
相似文献

摘要

What is disclosed is a system and method for determining when a subject is speaking from a respiratory signal obtained from a video of that subject. A video of a subject is received and a respiratory signal is extracted from a time-series signal is obtained from processing pixels in image frames of the video. The respiratory signal comprises an inspiratory signal and an expiratory signal. Cycle-level feature are extracted from the respiratory signal and used to identify expiratory signals during which speech is likely to have occurred. The identified expiratory signal are divided into time intervals. Frame-level features are determined for each time interval and an amount of distortion in the expiratory signal for this time interval is quantified. The amount of distortion is compared to a threshold. In response to the comparison, a determination is made that speech occurred during this interval. The process repeats for all time intervals.

机译：所公开的是一种系统和方法，用于根据从该对象的视频获得的呼吸信号来确定该对象何时在讲话。接收对象的视频，并从时间序列信号中提取呼吸信号，该时间信号是从处理视频图像帧中的像素获得的。呼吸信号包括吸气信号和呼气信号。从呼吸信号中提取出循环水平特征，并将其用于识别可能发生语音的呼气信号。所识别的呼气信号被分为时间间隔。确定每个时间间隔的帧级特征，并量化该时间间隔的呼气信号失真量。将失真量与阈值进行比较。响应于比较，确定在该间隔期间发生了语音。该过程在所有时间间隔内重复。

著录项

公开/公告号US2017294193A1

专利类型
公开/公告日2017-10-12

原文格式PDF
申请/专利权人 XEROX CORPORATION;
展开▼

申请/专利号US201615092287
发明设计人 PRAGATHI PRAVEENA;PRATHOSH ARAGULLA PRASAD;
展开▼

申请日2016-04-06
分类号G10L25/45;G10L25/78;G10L25/09;G06T7/20;
国家 US
入库时间 2022-08-21 13:52:46

相似文献

专利
外文文献
中文文献