首页> 外文会议>Machine Learning and Applications and Workshops (ICMLA), 2011 10th International Conference on >Non-Linear Semantic Embedding for Organizing Large Instrument Sample Libraries
【24h】

Non-Linear Semantic Embedding for Organizing Large Instrument Sample Libraries

机译:用于组织大型仪器样本库的非线性语义嵌入

获取原文

摘要

Though tags and metadata may provide rich indicators of relationships between high-level concepts like songs, artists or even genres, verbal descriptors lack the fine-grained detail necessary to capture acoustic nuances necessary for efficient retrieval of sounds in extremely large sample libraries. To these ends, we present a flexible approach titled Non-linear Semantic Embedding (NLSE), capable of projecting high-dimensional time-frequency representations of musical instrument samples into a low-dimensional, semantically-organized metric space. As opposed to other dimensionality reduction techniques, NLSE incorporates extrinsic semantic information in learning a projection, automatically learns salient acoustic features, and generates an intuitively meaningful output space.
机译:尽管标签和元数据可以提供有关歌曲,艺术家甚至流派等高级概念之间关系的丰富指标,但言语描述符缺乏捕获极其细微的细节,而这些细节对于捕获巨大样本库中有效检索声音所必需的声学细微差别是必不可少的。为此,我们提出了一种名为非线性语义嵌入(NLSE)的灵活方法,该方法能够将乐器样本的高维时频表示投影到低维,语义组织的度量空间中。与其他降维技术相反,NLSE在学习投影时结合了外部语义信息,自动学习了显着的声学特征,并生成了直观上有意义的输出空间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号