首页> 外文会议>IEEE/RSJ International Conference on Intelligent Robots and Systems >Semi-automatic bird song analysis by spatial-cue-based integration of sound source detection, localization, separation, and identification
【24h】

Semi-automatic bird song analysis by spatial-cue-based integration of sound source detection, localization, separation, and identification

机译:通过基于空间提示的声源检测,定位,分离和识别集成进行半自动鸟鸣分析

获取原文

摘要

This paper addresses bird song analysis based on semi-automatic annotation. Research in animal behavior, especially with birds, would be aided by automated (or semiautomated) systems that can localize sounds, measure their timing, and identify their source. This is difficult to achieve in real environments where several birds may be singing from different locations and at the same time. Analysis of recordings from the wild has in the past typically required manual annotation. Such annotation is not always accurate or even consistent, as it may vary both within or between observers. Here we propose a system that uses automated methods from robot audition, including sound source detection, localization, separation and identification. In robot audition these technologies have typically been studied separately; combining them often leads to poor performance in real-time application from the wild. We suggest that integration is aided by placing a primary focus on spatial cues, then combining other features within a Bayesian framework. A second problem has been that supervised machine learning methods typically requires a pre-trained model that may require a large training set of annotated labels. We have employed a semi-automatic annotation approach that requires much less pre-annotation. Preliminary experiments with recordings of bird songs from the wild revealed that for identification accuracy our system outperformed a method based on conventional robot audition.
机译:本文研究了基于半自动注释的鸟歌分析。自动化(或半自动化)系统可以辅助动物的行为研究,尤其是鸟类的行为,该系统可以对声音进行定位,测量其时机并确定其来源。在实际环境中很难做到这一点,在这种环境中,可能有几只鸟同时从不同的位置唱歌。过去,对野外录音的分析通常需要手动注释。由于注释在观察者内部或观察者之间可能会有所不同,因此有时并不总是准确甚至连贯的。在这里,我们提出了一种系统,该系统使用了来自机器人试听的自动化方法,包括声源检测,定位,分离和识别。在机器人试听中,通常会对这些技术进行单独研究。结合使用它们通常会导致野外实时应用程序的性能下降。我们建议通过将主要重点放在空间线索上,然后在贝叶斯框架内组合其他功能来辅助集成。第二个问题是,有监督的机器学习方法通​​常需要预先训练的模型,而该模型可能需要带注释标签的大量训练集。我们采用了一种半自动注释方法,该方法所需的预注释要少得多。记录野外鸟类鸣叫的初步实验表明,在识别准确性方面,我们的系统优于基于常规机器人试听的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号