Reintroducing KAPD as a Dataset for Machine Learning and Data Mining Applications

机译：重新引入KAPD作为计算机学习和数据挖掘应用的数据集

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

KACST Arabic Phonetic Database (KAPD) has been in use by researchers for around fifteen years since its initial release. Researches in acoustics and phonetics have benefited from its phonetically rich content. In fact, KAPD has the potential to go further steps with the research community. In this work, KAPD is subject to enhancements and improvements in order to serve as dataset for machine learning and data mining application. This work involves refining and reviewing the already existing metadata of KAPD and adding new material that are necessary for machine learning and data mining applications. The updated phoneme statistics after the corpus upgrade are presented from different perspectives. Data format and time units are made compatible with those of HTK. The paper discusses the potential of KAPD to serve as either a balanced or an imbalanced dataset.

机译：KACST阿拉伯语音数据库（KAPD）已被研究人员使用，自最初发布以来大约十五年。声学和语音学的研究从其语音富含含量中受益。事实上，KAPD有可能与研究界进行进一步的步骤。在这项工作中，KAPD旨在提高和改进，以便成为机器学习和数据挖掘应用的数据集。这项工作涉及炼油和审查已现有的KAPD元数据并添加机器学习和数据挖掘应用所需的新材料。语料库升级后更新的音素统计信息从不同的角度出现。数据格式和时间单位与HTK的数据格式和时间单位兼容。本文讨论了KAPD作为平衡或不平衡数据集的潜力。

著录项

来源
《UKSim-AMSS European Modelling Symposium on Computer Modelling and Simulation》|2016年|1 v.|共5页
会议地点
作者
Yasser Seddiq; Ali Meftah; Mansour Alghamdi; Yousef Alotaibi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类自动模拟理论（自动仿真理论）;
关键词
Europe;

机译：欧洲;

相似文献

外文文献
中文文献
专利

1. A data-attribute-space-oriented double parallel (DASODP) structure for enhancing extreme learning machine: Applications to regression datasets [J] . Yan-Lin He, Zhi-Qiang Geng, Qun-Xiong Zhu Engineering Applications of Artificial Intelligence . 2015,第may期

机译：面向数据属性空间的双并行（DASODP）结构，用于增强极限学习机：回归数据集的应用
2. Recent developments in human gait research: parameters, approaches, applications, machine learning techniques, datasets and challenges [J] . Prakash Chandra, Kumar Rajesh, Mittal Namita Artificial Intelligence Review: An International Science and Engineering Journal . 2018,第1期

机译：人体步态研究的最新进展：参数，方法，应用，机器学习技术，数据集和挑战
3. Accurate computation: COVID-19 rRT-PCR positive test dataset using stages classification through textual big data mining with machine learning [J] . Ramanathan Shalini, Ramasundaram Mohan Journal of supercomputing . 2021,第7期

机译：准确计算：Covid-19 RRT-PCR正测试数据集通过通过机器学习的文本大数据挖掘使用阶段分类
4. Reintroducing KAPD as a Dataset for Machine Learning and Data Mining Applications [C] . Yasser Seddiq, Ali Meftah, Mansour Alghamdi, UKSim-AMSS European Modelling Symposium on Computer Modelling and Simulation . 2016

机译：重新引入KAPD作为机器学习和数据挖掘应用程序的数据集
5. Multidisciplinary Applications of U.S. Soil Datasets: Machine Learning Models, Data Mining, and Land Use Analyses. [D] . Amanda M. Ramcharan. 2017

机译：美国土壤数据集的多学科应用：机器学习模型，数据挖掘和土地利用分析。
6. Quantification of histone modification ChIP-seq enrichment for data mining and machine learning applications [O] . Stephen A Hoang, Xiaojiang Xu, Stefan Bekiranov 2011

机译：用于数据挖掘和机器学习应用的组蛋白修饰ChIP-seq富集的量化
7. Optimization Techniques for Mining Power Quality Data and Processing Unbalanced Datasets in Machine Learning Applications [O] . Alvaro Furlani Bastos, Surya Santoso 2021

机译：用于挖掘电力质量数据的优化技术和机器学习应用中的不平衡数据集

Reintroducing KAPD as a Dataset for Machine Learning and Data Mining Applications

摘要

著录项

相似文献

相关主题

期刊订阅