“Are You Playing a Shooter Again?!” Deep Representation Learning for Audio-Based Video Game Genre Recognition

Amiriparian Shahin; Cummins Nicholas; Gerczuk Maurice; Pugachevskiy Sergey; Ottl Sandra; Schuller Bjorn

首页> 外文期刊>IEEE Transactions on Games >“Are You Playing a Shooter Again?!” Deep Representation Learning for Audio-Based Video Game Genre Recognition

【24h】

“Are You Playing a Shooter Again?!” Deep Representation Learning for Audio-Based Video Game Genre Recognition

机译：“你再次拍打射手吗？！”基于音频视频游戏类型识别的深度代表学习

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we present a novel computer audition task: audio-based video game genre classification. The aim of this study is threefold: 1) to check the feasibility of the proposed task; 2) to introduce a new corpus: The Game Genre by Audio + Multimodal Extracts (G(2) AME), collected entirely from social multimedia; and 3) to compare the efficacy of various acoustic feature spaces to classify the G(2) AME corpus into six game genres using a linear support vector machine classifier. For the classification we extract three different feature representations from the game audio files: 1) Knowledge-based acoustic features; 2) Deep Spectrum features; and 3) quantized Deep Spectrum features using Bag-of-Audio-Words. The Deep Spectrum features are a deep-learning-based representation derived from forwarding the visual representations of the audio instances, in particular spectrograms, mel-spectrograms, chromagrams, and their deltas through deep task-independent pretrained CNNs. Specifically, activations of fully connected layers from three common image classification CNNs, GoogLeNet, AlexNet, and VGG16 are used as feature vectors. Results for the six-genre classification problem indicate the suitability of our deep learning approach for this task. Our best method achieves an accuracy of up to 66.9% unweighted average recall using tenfold cross-validation.

机译：在本文中，我们提出了一种新颖的计算机试听任务：基于音频的视频游戏类型分类。本研究的目的是三倍：1）检查拟议任务的可行性; 2）介绍一个新的语料库：音频+多模式提取物的游戏类型（G（2）ame），完全来自社交多媒体; 3）比较各种声学特征空间的功效将G（2）ame语料库分类为使用线性支持向量机分类器将G（2）ame语料库分为六个游戏。对于分类，我们从游戏音频文件中提取三个不同的特征表示：1）基于知识的声学功能; 2）深度谱特征; 3）使用音频字袋来量化的深度频谱特征。深度频谱特征是一种基于深度学习的表示，通过深度任务独立的预制CNNS转发音频实例的视觉表示，特别是频谱图，熔点，Chromagrams及其Δ。具体地，从三个公共图像分类CNNS，Googlenet，AlexNet和VGG16激活完全连接的层作为特征向量。六种分类问题的结果表明我们对此任务的深度学习方法的适用性。我们最好的方法使用十倍交叉验证实现了高达66.9％的未加权平均召回的准确性。

著录项

来源
《IEEE Transactions on Games》 |2020年第2期|145-154|共10页
作者
Amiriparian Shahin; Cummins Nicholas; Gerczuk Maurice; Pugachevskiy Sergey; Ottl Sandra; Schuller Bjorn;
展开▼
作者单位

Univ Augsburg Embedded Intelligence Hlth Care & Wellbeing D-86159 Augsburg Germany|Tech Univ Munich Machine Intelligence & Signal Proc Grp D-80333 Munich Germany;

Univ Augsburg Embedded Intelligence Hlth Care & Wellbeing D-86159 Augsburg Germany;

Univ Augsburg Embedded Intelligence Hlth Care & Wellbeing D-86159 Augsburg Germany;

Univ Augsburg Embedded Intelligence Hlth Care & Wellbeing D-86159 Augsburg Germany;

Univ Augsburg Embedded Intelligence Hlth Care & Wellbeing D-86159 Augsburg Germany;

Univ Augsburg Embedded Intelligence Hlth Care & Wellbeing D-86159 Augsburg Germany|Imperial Coll London GLAM Grp Language Audio & Mus London SW7 2AZ England;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Games; Feature extraction; Task analysis; Acoustics; Monitoring; YouTube; Sports; Audio classification; convolutional neural network (CNN); deep learning; game genre classification;

机译：游戏;特征提取;任务分析;声学;监测;YouTube;体育;音频分类;卷积神经网络（CNN）;深入学习;游戏类型分类;

相似文献

外文文献
中文文献
专利

1. Deep Learning for Video Game Playing [J] . Justesen Niels, Bontrager Philip, Togelius Julian, IEEE Transactions on Games . 2020,第1期

机译：深度学习视频游戏播放
2. Playing a FPS Doom Video Game with Deep Visual Reinforcement Learning [J] . Adil Khan, Feng Jiang, Shaohui Liu, Automatic Control and Computer Sciences . 2019,第3期

机译：使用深度视觉强化学习的FPS Doom视频游戏
3. Assaying neural activity of children during video game play in public spaces: a deep learning approach [J] . Ravindran Akshay Sujatha, Mobiny Aryan, Cruz-Garza Jesus G., Journal of neural engineering . 2019,第3期

机译：在公共场所进行视频游戏时分析儿童的神经活动：一种深度学习方法
4. Pororobot: A Deep Learning Robot that Plays Video QA Games [C] . Kyung-Min Kim, Chang-Jun Nan, Jung-Woo Ha, Association for the Advancement of Artificial Intelligence Symposium . 2015

机译：Pororobot：播放视频Q＆A游戏的深层学习机器人
5. Full spectrum propaganda: The United States military, video games, and the genre of the military-themed shooter. [D] . Clearwater, David A. 2006

机译：全谱宣传：美国军事，电子游戏以及以军事为主题的射击游戏的类型。
6. Just how expert are expert video-game players? Assessing the experience and expertise of video-game players across action video-game genres [O] . Andrew J. Latham, Lucy L. M. Patston, Lynette J. Tippett 2013

机译：专家视频游戏玩家到底有多专业？评估跨动作视频游戏类型的视频游戏玩家的经验和专业知识
7. Deep Reinforcement Learning Agent for Playing 2D Shooting Games [O] . Dongcheul Lee, Janise McNair 2018

机译：用于演奏2D射击游戏的深增强学习代理

“Are You Playing a Shooter Again?!” Deep Representation Learning for Audio-Based Video Game Genre Recognition

摘要

著录项

相似文献

相关主题

期刊订阅