首页> 外文学位 >Investigations in music similarity: Analysis, organization, and visualization using tonal features.
【24h】

Investigations in music similarity: Analysis, organization, and visualization using tonal features.

机译:音乐相似性调查:使用音调特征进行分析,组织和可视化。

获取原文
获取原文并翻译 | 示例

摘要

This dissertation is in the area of music information retrieval, which is an interdisciplinary science that incorporates knowledge and expertise from artificial intelligence, music theory, mathematical modeling, computational analysis, databases, music perception and music cognition. We are focused on developing computational ways to accurately assess, quantify, and visualize degrees of musical similarity. This involves the end-to-end development of computational tools, from the design of the mathematical models, to the implementation and testing of the algorithms on large datasets, to the creation of an intuitive and user-centered interface for communicating the results. This dissertation has two parts: music similarity assessment and music visualization.; Music similarity assessment is a complex problem; definitions of similarity can diverge widely and be highly subjective. Can we build computer models to recognize these different degrees of similarity? Our work addresses this question, and has focused on the development of similarity metrics based on tonal features, which are obtained from pitch and key information. We have developed four methods of similarity assessment, each using one of the following features: pitch class distributions, key sequences, key distributions, and mean-time-in-key distributions, and based on one of the following similarity metrics: L1 norm, L 2 norm, and sequence alignment.; We use the similarity assessment techniques to conduct two sets of experiments: the first uses different renditions of pieces, while the second uses theme and variation pieces. For each experiment, all four methods are used to compare the pieces in each data set one to another. Statistical analyses such as quantile-quantile plots and the Kolmogorov-Smirnov test confirm that comparison results from within similar and across dissimilar sets come from different underlying distributions for all the methods. A Mann-Whitney rank sum test confirms that results for similar and dissimilar pieces come from distributions with different medians for all the methods. We further compute Type I, Type II and Bayesian probabilities to analyze each method's performance.; While metrics are a quick and clear way to determine similarity, visualizations can add a richness and complexity to the analysis. Our goal is to present music information in a visual form that is intuitive and easy to access. One method of visualization we have developed is a dynamic visualization that displays the progression of the tonal content of a music piece on a two-dimensional representation of keys. The sequence of keys in a music piece is mapped onto a space that contains points representing all possible keys. The distribution of keys of a piece being visualized is indicated as growing colored discs, where the colors correspond to the keys detected, and the size of the discs to the key frequency. This visualization is an improvement over more basic charting methods, such as histograms, and it maintains standards of information design in the form of added dimensionality, color, and animation. We show that the visualization is invariant under music transformations that preserve the piece's identity.; We demonstrate the dynamic visualization system using two music genres. We consider classical and Armenian music. Classical music tends to follow a pattern of beginning in the key of the piece, traveling to neighboring keys throughout the course of the piece before returning to the key of the piece in the end. In contrast, Armenian music follows a more sequential pattern where the piece begins in a key, remains there for a period of time before moving on to other keys. It rarely ends in the key it first visited. We use the visualization method to illustrate these patterns for a set of classical and Armenian pieces.; Another method of visualization we have developed exploits the tonal properties of music to derive a hierarchical description for each piece that can then be
机译:本文涉及音乐信息检索领域,它是一门跨学科的科学,融合了人工智能,音乐理论,数学建模,计算分析,数据库,音乐感知和音乐认知等方面的知识和专长。我们专注于开发计算方法,以准确评估,量化和可视化音乐相似度。这涉及端到端的计算工具开发,从数学模型的设计到大型数据集上算法的实现和测试,再到创建直观且以用户为中心的界面来传达结果。本文分为两个部分:音乐相似性评估和音乐可视化。音乐相似性评估是一个复杂的问题;相似性的定义可能差异很大,而且主观性很高。我们可以建立计算机模型来识别这些不同程度的相似性吗?我们的工作解决了这个问题,并致力于基于音调特征的相似性度量的开发,这些特征是从音调和关键信息中获得的。我们已经开发了四种相似度评估方法,每种方法都使用以下功能之一:音高等级分布,键序列,键分布和平均键时分布,并且基于以下相似度指标之一:L1范数, L 2规范和序列比对。我们使用相似性评估技术进行两组实验:第一组使用不同的片段再现,第二组使用主题和变体片段。对于每个实验,所有四种方法都用于将每个数据集中的各个部分进行比较。诸如分位数分位数图和Kolmogorov-Smirnov检验之类的统计分析证实,对于所有方法,相似组之间和不同组之间的比较结果均来自不同的基础分布。 Mann-Whitney秩和检验证实,对于所有方法,相似和不相似件的结果均来自具有不同中位数的分布。我们进一步计算I型,II型和贝叶斯概率,以分析每种方法的性能。指标是确定相似性的快速而清晰的方法,而可视化可以为分析增加丰富性和复杂性。我们的目标是以直观且易于访问的视觉形式呈现音乐信息。我们开发的一种可视化方法是动态可视化,可以在按键的二维表示上显示音乐作品的音调内容的进度。音乐作品中的键序列被映射到一个包含代表所有可能键的点的空间。可视化的棋子的按键分布表示为不断增长的彩色光盘,其中颜色对应于检测到的按键,而光盘的大小对应于按键频率。这种可视化是对诸如直方图之类的更基本的制图方法的改进,并且它以增加的尺寸,颜色和动画的形式维护了信息设计的标准。我们表明,在音乐变换下,可视化是不变的,可以保持乐曲的身份。我们演示了使用两种音乐流派的动态可视化系统。我们考虑古典音乐和亚美尼亚音乐。古典音乐倾向于遵循从乐曲的键开始的模式,在乐曲的整个过程中传播到相邻的键,然后最终返回到乐曲的键。相比之下,亚美尼亚音乐遵循更连续的模式,即乐曲从一个键开始,在此键上停留了一段时间后再转到其他键。它很少以首次访问的密钥结尾。我们使用可视化方法来说明一组古典和亚美尼亚作品的这些模式。我们开发的另一种可视化方法是利用音乐的音调属性来为每个片段导出层次描述,然后

著录项

  • 作者

    Mardirossian, Arpi.;

  • 作者单位

    University of Southern California.$bIndustrial and Systems Engineering: Doctor of Philosophy.;

  • 授予单位 University of Southern California.$bIndustrial and Systems Engineering: Doctor of Philosophy.;
  • 学科 Music.; Artificial Intelligence.
  • 学位 Ph.D.
  • 年度 2007
  • 页码 141 p.
  • 总页数 141
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 音乐;人工智能理论;
  • 关键词

  • 入库时间 2022-08-17 11:39:14

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号