首页> 外文学位 >Automatic recognition of harmonic bird sounds.
【24h】

Automatic recognition of harmonic bird sounds.

机译:自动识别鸟的谐音。

获取原文
获取原文并翻译 | 示例

摘要

This dissertation analyzes the vocalizations of several common bird species: herring gull, bluejay, American crow, and Canada goose. It qualitatively analyzes the spectrograms of vocalizations of these species and then quantitatively analyzes the vocalizations using frequency track analysis. A frequency track is the path travelled by a peak in the DFT spectrum of a segment of a sound file as the segment is shifted forward in time. There are two parts to any pattern recognition system: (1) producing statistical models for each pattern to be recognized; and (2) using the models to find the patterns in real test data. The training procedure for this research requires hand selection of the frequency tracks corresponding to each training vocalization in the training data set using the frequency track analysis capabilities of Gtkvis. Following the formation of the frequency track files for each vocalization instance, a statistical model of the vocalization is created using the MakeModel() function. The recognition algorithm extracts sets of frequency tracks that closely approximate harmonic sounds in the sound file being processed. The set extraction function GetSets() uses a preliminary set extracting function called FindFeasibleSets() followed by a function that refines the "feasible sets" called FindMaximalSubsets(). Each extracted set in its final form is then compared with the statistical models generated during the training phase. If it matches one of the models closely, the recognizer declares the set is an occurrence of the corresponding vocalization.; The final result is a hardware and software implementation of a complete sound recognition system based on a methodology easily adapted to a wide class of vocalizations. One set of hardware consists of a handheld digital recorder, microphone, and pre-amp. The other set of hardware (in development) consists of an array of microphones coupled to a steerable parabolic dish microphone. The software consists of a sound visualization and processing application called Gtkvis and some command line tools for training and recognition that could easily be integrated into the Gtkvis GUI.
机译:本文分析了几种常见鸟类的发声:鲱鸥,蓝鸟,美洲乌鸦和加拿大鹅。它定性地分析了这些物种发声的声谱图,然后使用频率跟踪分析定量分析了发声。频率轨道是声音文件的某个片段的DFT频谱中的一个峰值随该片段在时间上向前移动而经过的路径。任何模式识别系统都有两个部分:(1)为要识别的每个模式生成统计模型; (2)使用模型在真实测试数据中找到模式。此研究的训练过程需要使用Gtkvis的频率跟踪分析功能,手动选择与训练数据集中每个训练发声对应的频率跟踪。在为每个发声实例形成频率跟踪文件之后,使用MakeModel()函数创建发声的统计模型。识别算法提取频率轨道集,这些频率轨道集非常接近正在处理的声音文件中的谐波声音。集合提取函数GetSets()使用一个称为FindFeasibleSets()的初步集合提取函数,然后使用一个细化名为FindMaximalSubsets()的“可行集合”的函数。然后将每个最终形式的提取集与训练阶段生成的统计模型进行比较。如果它与这些模型之一紧密匹配,则识别器声明该集合是相应发声的出现。最终结果是基于易于适应各种发声方法的方法,构成了完整的声音识别系统的硬件和软件。一组硬件包括一个手持数字录音机,麦克风和前置放大器。另一组硬件(正在开发中)由与可控抛物面碟形麦克风耦合的麦克风阵列组成。该软件由一个名为Gtkvis的声音可视化和处理应用程序以及一些用于培训和识别的命令行工具组成,这些工具可以轻松集成到Gtkvis GUI中。

著录项

  • 作者

    Heller, Jason Robert.;

  • 作者单位

    State University of New York at Stony Brook.;

  • 授予单位 State University of New York at Stony Brook.;
  • 学科 Mathematics.
  • 学位 Ph.D.
  • 年度 2005
  • 页码 89 p.
  • 总页数 89
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 数学;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号