首页> 美国卫生研究院文献>The Journal of the Acoustical Society of America >An image processing based paradigm for the extraction of tonal sounds in cetacean communications
【2h】

An image processing based paradigm for the extraction of tonal sounds in cetacean communications

机译:基于图像处理的范例在鲸类通信中提取音调

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Dolphins and whales use tonal whistles for communication, and it is known that frequency modulation encodes contextual information. An automated mathematical algorithm could characterize the frequency modulation of tonal calls for use with clustering and classification. Most automatic cetacean whistle processing techniques are based on peak or edge detection or require analyst assistance in verifying detections. An alternative paradigm is introduced using techniques of image processing. Frequency information is extracted as ridges in whistle spectrograms. Spectral ridges are the fundamental structure of tonal vocalizations, and ridge detection is a well-established image processing technique, easily applied to vocalization spectrograms. This paradigm is implemented as freely available matlab scripts, coined IPRiT (image processing ridge tracker). Its fidelity in the reconstruction of synthesized whistles is compared to another published whistle detection software package, silbido. Both algorithms are also applied to real-world recordings of bottlenose dolphin (Tursiops trunactus) signature whistles and tested for the ability to identify whistles belonging to different individuals. IPRiT gave higher fidelity and lower false detection than silbido with synthesized whistles, and reconstructed dolphin identity groups from signature whistles, whereas silbido could not. IPRiT appears to be superior to silbido for the extraction of the precise frequency variation of the whistle.
机译:海豚和鲸鱼使用音调的口哨进行交流,众所周知,频率调制会编码上下文信息。自动化的数学算法可以表征用于聚类和分类的音频呼叫的频率调制。大多数自动鲸类口哨处理技术都基于峰值或边缘检测,或者需要分析师协助来验证检测。使用图像处理技术介绍了另一种范例。频率信息被提取为哨声频谱图中的脊。频谱脊是音调发声的基本结构,并且脊线检测是一种成熟的图像处理技术,可轻松应用于发声声谱图。此范例是作为可免费获得的matlab脚本(称为IPRiT(图像处理岭跟踪器))实现的。将其在合成哨声重构中的保真度与另一个已发布的哨声检测软件包silbido进行了比较。两种算法还应用于宽吻海豚(Tursiops trunactus)签名哨声的真实记录,并测试了识别属于不同个人的哨声的能力。与带有合成哨子的silbido相比,IPRiT具有更高的保真度和更低的误检率,并且可以从签名哨声中重建海豚身份组,而silbido则不能。 IPRiT在提取口哨的精确频率变化方面似乎优于silbido。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号