Audio spatio-temporal fingerprints for cloudless real-time hands-free diarization on mobile devices

机译：音频时空指纹可在移动设备上实现无云实时免提区分

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose a new low bit rate representation of a sound field and a new method for the corresponding cloudless low delay hands-free diarization suitable for low-performance mobile devices, e.g. mobile phones. The proposed audio spatio-temporal fingerprint representation results in low bit rate (500 bytes/second), however contains complete information about continuous audio tracking of multiple acoustic sources in an open, unconstrained environment. The core of the algorithm is based on simultaneous multiple data stream processing using audio spatio-temporal fingerprint representation to cover higher level events relevant for diarization, e.g. turns, interruptions, crosstalk, speech and non-speech segments. Performance levels achieved to date on 5 hours of hand-labelled datasets have shown the feasibility of the approach at the same time as resulting in 7.58% CPU load on 1-core ultra-low-power mobile processor running at 1 GHz and low algorithmic delay of 112 ms.

机译：在本文中，我们提出了一种新的声场低比特率表示方法，以及一种适用于低性能移动设备（例如，低功耗移动设备）的对应的无云低延迟免提数字化的新方法。手机。所提出的音频时空指纹表示导致较低的比特率（500字节/秒），但是包含有关在开放，不受约束的环境中对多个声源进行连续音频跟踪的完整信息。该算法的核心是基于同时进行的多个数据流处理，该处理使用音频时空指纹表示来覆盖与数字化相关的更高级别的事件，例如：转弯，打断，串扰，语音和非语音片段。迄今为止，在5个小时的手工标记数据集上所达到的性能水平已经证明了该方法的可行性，因为该方法可在运行于1 GHz的1核超低功耗移动处理器上实现7.58％的CPU负载，并且算法延迟低的112毫秒。

著录项

来源
《2011 Joint Workshop on Hands-free Speech Communication and Microphone Arrays》|2011年|p.25-30|共6页
会议地点
作者
Korchagin Danil;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类电声技术和语音信号处理;
关键词
Microphone arrays; array signal processing; mobile computing; source coding;

机译：麦克风阵列;阵列信号处理;移动计算;源代码;

相似文献

外文文献
中文文献
专利

1. ONLINE SPEAKER DIARIZATION FOR MULTIMEDIA DATA RETRIEVAL ON MOBILE DEVICES [J] . KYUNG-MI PARK, JEONG-SIK PARK, JAE-HYUN BAE, International Journal of Pattern Recognition and Artificial Intelligence . 2012,第8期

机译：移动设备上多媒体数据的在线说话人数字化检索
2. Sub-fingerprint masking for a robust audio fingerprinting system in a real-noise environment for portable consumer devices [J] . Consumer Electronics, IEEE Transactions on . 2010,第1期

机译：用于便携式消费类设备的真实噪声环境中的健壮音频指纹识别系统的子指纹屏蔽
3. MONSTER BEGINS SHIPPING NEW MOBILE PRODUCTS Charging and hands-free devices for iPads, iPhones, iPods [J] . Jeff OHeir Dealerscope . 2011,第4期

机译：怪物开始运输新的移动产品用于iPad，iPhone，iPod的充电和免提设备
4. Audio spatio-temporal fingerprints for cloudless real-time hands-free diarization on mobile devices [C] . Korchagin Danil Joint Workshop on Hands-free Speech Communication and Microphone Arrays . 2011

机译：用于无云实时免提升级的音频时空指纹在移动设备上
5. Audio Screen: Unsighted Game Mechanics for Mobile Devices. [D] . Brammer, Jonathon. 2016

机译：音频屏幕：移动设备的无知游戏机制。
6. Method for Reading Sensors and Controlling Actuators Using Audio Interfaces of Mobile Devices [O] . Rafael V. Aroca, Aquiles F. Burlamaqui, Luiz M. G. Gonçalves 2012

机译：使用移动设备的音频接口读取传感器和控制执行器的方法
7. Audio spatio-temporal fingerprints for cloudless real-time hands-free diarization on mobile devices [O] . Danil Korchagin 2011

机译：音频时空指纹，用于移动设备上的无云实时免提分类

Audio spatio-temporal fingerprints for cloudless real-time hands-free diarization on mobile devices

摘要

著录项

相似文献

相关主题

期刊订阅