首页> 外文期刊>International journal of speech technology >Large scale data based audio scene classification
【24h】

Large scale data based audio scene classification

机译:基于大规模数据的音频场景分类

获取原文
获取原文并翻译 | 示例
           

摘要

Artificial Intelligence and Machine learning has been used by many research groups for processing large scale data known as big data. Machine learning techniques to handle large scale complex datasets are expensive to process computation. Apache Spark framework called spark MLlib is becoming a popular platform for handling big data analysis and it is used for many machine learning problems such as classification, regression and clustering. In this work, Apache Spark and the advanced machine learning architecture of a Deep Multilayer Perceptron (MLP), is proposed for Audio Scene Classification. Log Mel band features are used to represent the characteristics of the input audio scenes. The parameters of the DNN are set according to the DNN baseline of DCASE 2017 challenge. The system is evaluated with TUT dataset (2017) and the result is compared with the baseline provided.
机译:许多研究小组已使用人工智能和机器学习来处理称为大数据的大规模数据。用于处理大规模复杂数据集的机器学习技术对于处理计算而言非常昂贵。 Apache Spark框架称为spark MLlib正在成为处理大数据分析的流行平台,并且用于许多机器学习问题,例如分类,回归和聚类。在这项工作中,提出了Apache Spark和深度多层感知器(MLP)的高级机器学习架构,用于音频场景分类。 Log Mel波段特征用于表示输入音频场景的特征。 DNN的参数是根据DCASE 2017挑战的DNN基线设置的。使用TUT数据集(2017)对系统进行评估,并将结果与​​提供的基线进行比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号