Automatic recognition of harmonic bird sounds.

机译：自动识别鸟的谐音。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

This dissertation analyzes the vocalizations of several common bird species: herring gull, bluejay, American crow, and Canada goose. It qualitatively analyzes the spectrograms of vocalizations of these species and then quantitatively analyzes the vocalizations using frequency track analysis. A frequency track is the path travelled by a peak in the DFT spectrum of a segment of a sound file as the segment is shifted forward in time. There are two parts to any pattern recognition system: (1) producing statistical models for each pattern to be recognized; and (2) using the models to find the patterns in real test data. The training procedure for this research requires hand selection of the frequency tracks corresponding to each training vocalization in the training data set using the frequency track analysis capabilities of Gtkvis. Following the formation of the frequency track files for each vocalization instance, a statistical model of the vocalization is created using the MakeModel() function. The recognition algorithm extracts sets of frequency tracks that closely approximate harmonic sounds in the sound file being processed. The set extraction function GetSets() uses a preliminary set extracting function called FindFeasibleSets() followed by a function that refines the "feasible sets" called FindMaximalSubsets(). Each extracted set in its final form is then compared with the statistical models generated during the training phase. If it matches one of the models closely, the recognizer declares the set is an occurrence of the corresponding vocalization.; The final result is a hardware and software implementation of a complete sound recognition system based on a methodology easily adapted to a wide class of vocalizations. One set of hardware consists of a handheld digital recorder, microphone, and pre-amp. The other set of hardware (in development) consists of an array of microphones coupled to a steerable parabolic dish microphone. The software consists of a sound visualization and processing application called Gtkvis and some command line tools for training and recognition that could easily be integrated into the Gtkvis GUI.

机译：本文分析了几种常见鸟类的发声：鲱鸥，蓝鸟，美洲乌鸦和加拿大鹅。它定性地分析了这些物种发声的声谱图，然后使用频率跟踪分析定量分析了发声。频率轨道是声音文件的某个片段的DFT频谱中的一个峰值随该片段在时间上向前移动而经过的路径。任何模式识别系统都有两个部分：（1）为要识别的每个模式生成统计模型；（2）使用模型在真实测试数据中找到模式。此研究的训练过程需要使用Gtkvis的频率跟踪分析功能，手动选择与训练数据集中每个训练发声对应的频率跟踪。在为每个发声实例形成频率跟踪文件之后，使用MakeModel（）函数创建发声的统计模型。识别算法提取频率轨道集，这些频率轨道集非常接近正在处理的声音文件中的谐波声音。集合提取函数GetSets（）使用一个称为FindFeasibleSets（）的初步集合提取函数，然后使用一个细化名为FindMaximalSubsets（）的“可行集合”的函数。然后将每个最终形式的提取集与训练阶段生成的统计模型进行比较。如果它与这些模型之一紧密匹配，则识别器声明该集合是相应发声的出现。最终结果是基于易于适应各种发声方法的方法，构成了完整的声音识别系统的硬件和软件。一组硬件包括一个手持数字录音机，麦克风和前置放大器。另一组硬件（正在开发中）由与可控抛物面碟形麦克风耦合的麦克风阵列组成。该软件由一个名为Gtkvis的声音可视化和处理应用程序以及一些用于培训和识别的命令行工具组成，这些工具可以轻松集成到Gtkvis GUI中。

著录项

作者
Heller, Jason Robert.;
展开▼
作者单位

State University of New York at Stony Brook.;

展开▼
授予单位 State University of New York at Stony Brook.;
学科 Mathematics.
学位 Ph.D.
年度 2005
页码 89 p.
总页数 89
原文格式 PDF
正文语种 eng
中图分类数学;
关键词

相似文献

外文文献
中文文献
专利

1. Automatic recognition of harmonic bird sounds using a frequency track extraction algorithm [J] . Heller JR, Pinezich JD The Journal of the Acoustical Society of America . 2008,第3期

机译：使用频率轨迹提取算法自动识别鸟的谐音
2. Automatic recognition of harmonic bird soundsusing a frequency track extraction algorithm [J] . Jason R. Heller, John D. Pinezich The Journal of the Acoustical Society of America . 2008,第3aPta1期

机译：利用频率轨迹提取算法自动识别鸟的谐音
3. Sound-spectrogram based automatic bird species recognition using MLP classifier [J] . Pahuja Roop, Kumar Avijeet Applied Acoustics . 2021,第Sepa期

机译：基于Sound-谱图的自动鸟类使用MLP分类器识别
4. Automatic Bird Species Recognition Based on Spiking Neural Network [C] . Ricky Mohanty, Bandi Kumar Mallik, Sandeep Singh Solanki International Conference on Nanoelectronics, circuits and communication systems . 2018

机译：基于尖峰神经网络的鸟类自动识别
5. A multimodal fusion approach for automatic postal address recognition system using Optical Character Recognition (OCR) and Automatic Speech Recognition (ASR) techniques. [D] . Singh, Amriteshwar. 2011

机译：一种使用光学字符识别（OCR）和自动语音识别（ASR）技术的自动邮政地址识别系统的多模式融合方法。
6. Automatic counting of birds in a bird deterrence field trial [O] . Elizabeth S. Simons, Mark K. Hinders 2019

机译：在鸟类威慑野外试验中自动计数鸟类
7. The Segregation Of Frequency-Modulated Concurrent Harmonic Sounds. [O] . Alain de Cheveigne, Cecile Marin, Alain De Cheveigné (cnrs/université Paris, 1996

机译：调频并发谐波的分离。

Automatic recognition of harmonic bird sounds.

摘要

著录项

相似文献

相关主题

期刊订阅