首页> 外文会议> >SpeakerSense: Energy Efficient Unobtrusive Speaker Identification on Mobile Phones
【24h】

SpeakerSense: Energy Efficient Unobtrusive Speaker Identification on Mobile Phones

机译:SpeakerSense:移动电话上的高能效扬声器识别

获取原文
获取原文并翻译 | 示例

摘要

Automatically identifying the person you are talking with using continuous audio sensing has the potential to enable many pervasive computing applications from memory assistance to annotating life logging data. However, a number of challenges, including energy efficiency and training data acquisition, must be addressed before unobtrusive audio sensing is practical on mobile devices. We built SpeakerSense, a speaker identification prototype that uses a heterogeneous multi-processor hardware architecture that splits computation between a low power processor and the phone's application processor to enable continuous background sensing with minimal power requirements. Using SpeakerSense, we benchmarked several system parameters (sampling rate, GMM complexity, smoothing window size, and amount of training data needed) to identify thresholds that balance computation cost with performance. We also investigated channel compensation methods that make it feasible to acquire training data from phone calls and an automatic segmentation method for training speaker models based on one-to-one conversations.
机译:使用连续音频感应自动识别正在与您交谈的人的潜力有可能使许多普及的计算应用程序从存储协助到注释生活记录数据成为可能。但是,在移动设备上进行无干扰的音频传感之前,必须解决许多挑战,包括能效和培训数据获取。我们构建了SpeakerSense,这是一种说话人识别原型,它使用异构的多处理器硬件体系结构在低功耗处理器和电话应用处理器之间分配计算能力,从而以最小的功耗要求实现连续的背景感应。使用SpeakerSense,我们对多个系统参数(采样率,GMM复杂度,平滑窗口大小和所需的训练数据量)进行了基准测试,以识别可在计算成本与性能之间取得平衡的阈值。我们还研究了使从电话中获取训练数据变得可行的信道补偿方法,以及基于一对一对话来训练说话人模型的自动分割方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号