首页> 外文学位 >A clustering and principal component approach to exemplar based machine learning for classification identification.

【24h】

A clustering and principal component approach to exemplar based machine learning for classification identification.

机译：一种基于样本的机器学习的聚类和主成分方法，用于分类识别。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Classifying detections is an important field of study in many disciplines. Typically, data can be represented in the form of a multidimensional vector defined within some hyperspace (e.g. One may have the sepal length, sepal width, petal length and petal width of an iris flower). One can view many classification problems as processing an unknown data vector in some way that produces an output which correctly categorizes it (e.g. Is the iris flower Iris Setosa, Iris Versicolour or Iris Virginica)? Exemplar based machine learning techniques tackle these problems by learning from representative training data. Several popular algorithms employing these techniques in various ways have been developed and published in the literature. This study explores and develops an innovative exemplar based machine learning methodology which combines clustering techniques with the tools of principal components analysis (PCA) to tackle this problem. Through clustering the methodology segments each classification's arbitrary multidimensional complex shape of training data in a way which can be adequately generalized using the tools of PCA. This generalization is then applied toward the development of an exemplar based machine learning algorithm capable of classifying unknown data. The methodology was applied to twenty one real world data sets obtained from the University of California at Irvine data repository and the results were compared to those of other research methods. The overall accuracy results equaled or exceeded the absolute best of any other method found by the author for twelve out of the twenty one data sets tested.; The development of a measure of confidence for each classification declared for any given unknown is discussed. Concepts are then proposed which would allow one to decrease the amount of information presented to a user based on the confidence level that the classification was made correctly. This confidence based filtering offers the potential of further increasing the overall accuracy of the algorithm. To highlight, the results suggest that the proposed methodology has a high degree of real world applicability and could be used over a wide range of application domains yielding highly competitive accuracies.

机译：对检测进行分类是许多学科的重要研究领域。通常，数据可以以在某些超空间内定义的多维矢量的形式表示（例如，可以具有鸢尾花的萼片长度，萼片宽度，花瓣长度和花瓣宽度）。在以某种方式处理未知数据向量并产生正确分类的输出时，可以看到许多分类问题（例如鸢尾花鸢尾花Setosa，鸢尾花Versicolour还是鸢尾花Virginica）？基于示例的机器学习技术通过从代表性训练数据中学习来解决这些问题。在文献中已经开发出了几种以各种方式采用这些技术的流行算法。这项研究探索并开发了一种创新的基于示例的机器学习方法，该方法将聚类技术与主成分分析（PCA）工具相结合来解决此问题。通过对方法分类进行聚类，可以使用PCA的工具对每个分类的训练数据的任意多维复杂形状进行适当地概括。然后，将这种概括应用于能够对未知数据进行分类的基于示例的机器学习算法。将该方法应用于从加州大学尔湾分校数据存储库获得的21个现实世界数据集，并将结果与其他研究方法进行了比较。总体准确性结果等于或超过作者对21个测试数据集中的12个方法所发现的任何其他方法的绝对最佳结果。讨论了针对任何给定未知数声明的每个分类的置信度度量的发展。然后提出了一些概念，这些概念将允许人们根据正确进行分类的置信度来减少提供给用户的信息量。这种基于置信度的过滤提供了进一步提高算法整体精度的潜力。值得一提的是，结果表明，所提出的方法在现实世界中具有高度适用性，可以在产生高度竞争准确性的广泛应用领域中使用。

著录项

作者
Cassella, Vincent A.;
展开▼
作者单位

The Catholic University of America.;

展开▼
授予单位 The Catholic University of America.;
学科 Engineering Electronics and Electrical.; Artificial Intelligence.
学位 Ph.D.
年度 2006
页码 125 p.
总页数 125
原文格式 PDF
正文语种 eng
中图分类无线电电子学、电信技术;人工智能理论;
关键词
入库时间 2022-08-17 11:39:31

相似文献

外文文献
中文文献
专利

1. PCA-ELM: A Robust and Pruned Extreme Learning Machine Approach Based on Principal Component Analysis [J] . A. Castano, F. Fernandez-Navarre, C. Hervas-Martinez Neural processing letters . 2013,第3期

机译：PCA-ELM：一种基于主成分分析的鲁棒修剪的极限学习机方法
2. Ex-situ porosity classification in metallic components by laser metal deposition: A machine learning-based approach [J] . Garcia-Moreno Angel-Ivan, Alvarado-Orozco Juan-Manuel, Ibarra-Medina Juansethi, Journal of Manufacturing Processes . 2021,第Feba期

机译：通过激光金属沉积在金属组件中的前沟孔隙度分类：基于机器学习的方法
3. Macro-classification of meteorites by portable energy dispersive X-ray fluorescence spectroscopy (pED-XRF), principal component analysis (PCA) and machine learning algorithms [J] . Talanta: The International Journal of Pure and Applied Analytical Chemistry . 2020,第期

机译：便携式能量分散X射线荧光光谱（PED-XRF），主成分分析（PCA）和机器学习算法的宏观分类
4. Classification of Foreign Language Mobile Learning Strategy Based on Principal Component Analysis and Support Vector Machine [C] . Shuai Hu, Yan Gu, Yingxin Cheng International Conference on Information Technology and Intelligent Transportation Systems . 2017

机译：基于主成分分析和支持向量机的外语移动学习策略分类
5. Shape Theoretic and Machine Learning Based Methods for Automatic Clustering and Classification of Cardiomyocytes Based on Action Potential Morphology [D] . Gorospe, Giann 2018

机译：基于形状理论和机器学习的基于动作电位形态学的心肌细胞自动聚类和分类方法
6. Path Loss Prediction Based on Machine Learning Techniques: Principal Component Analysis Artificial Neural Network and Gaussian Process [O] . Han-Shin Jo, Chanshin Park, Eunhyoung Lee, 2020

机译：基于机器学习技术的路径损耗预测：主成分分析人工神经网络和高斯过程
7. The Comparison of Density-Based Clustering Approach among Different Machine Learning Models on Paddy Rice Image Classification of Multispectral and Hyperspectral Image Data [O] . Shiuan Wan, Yi-Ping Wang 2020

机译：多光谱和高光谱图像数据水稻图像分类不同机器学习模型中基于密度的聚类方法的比较

A clustering and principal component approach to exemplar based machine learning for classification identification.

摘要

著录项

相似文献

相关主题

期刊订阅