Classification of Astronomical Objects in the Galaxy M81 using Machine Learning Techniques II. An Application of Clustering in Data Pre-processing

机译：使用机器学习技术II的Galaxy M81中天文对象的分类。在数据预处理中群集的应用

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Identifying objects with a certain class in the current data in astronomy are challenging. In this study, we explored the methods to identify globular cluster candidates from a pool of astronomical objects in the galaxy M81. First, we developed a method to automatically cross-match the data. This process was done by manually overlayed the imaging data in the previous study. The process also eliminated the data points that only appear in only one or two filters, which indicates that they are artifacts. Next, we used the Expectation Maximization (EM) clustering technique to label the training dataset with classes and to reduce the use of humans in the preprocessing process. Our results show that the data can be clustered into 12 clusters, which can be grouped into 6 groups of astronomical objects with similar morphological structures. When using these 6 groups of data to build classification models, we found that the prediction accuracies have improved significantly. In the case of Random Forest, the accuracy has improved from 79.9% to 90.57% and from 67.1% to 91.59% for Multilayer Perceptron. Moreover, when using the model built from those data to analyze the unseen dataset, the results also show that the model can categorize the objects into classes with characteristics close to those in astronomy. However, this model still cannot fully separate globular clusters from foreground stars and background galaxies due to the similarities in their photometric properties.

机译：在天文学中的当前数据中识别具有某个类的对象是具有挑战性的。在这项研究中，我们探讨了从星系M81中的天文对象池中识别球状聚类候选的方法。首先，我们开发了一种自动交叉匹配数据的方法。该过程是通过手动覆盖前一项研究中的成像数据来完成的。该过程还消除了仅在一个或两个过滤器中出现的数据点，这表明它们是伪影。接下来，我们使用期望最大化（EM）聚类技术与类标记训练数据集，并在预处理过程中减少人类的使用。我们的结果表明，数据可以集聚集到12个集群中，可以将其分为6组，具有相似的形态结构。当使用这6组数据来构建分类模型时，我们发现预测精度显着提高。在随机森林的情况下，对于多层的感知，精度从79.9％提高到90.57％至90.57％，从67.1％到91.59％。此外，当使用从这些数据构建的模型来分析未知数据集时，结果还表明，该模型可以将物体分为与天文学中的特征接近的特性。然而，由于它们的光度特性中的相似性，该模型仍然不能完全将来自前景星和背景星系的球簇区分开。

著录项

来源
《International Joint Conference on Computer Science and Software Engineering》|2021年|1-6|共6页
会议地点
作者
Tapanapong Chuntama; Chutipong Suwannajak; Prapaporn Techa-Angkoon; Benjamas Panyangam; Nahathai Tanakul;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Training; Analytical models; Predictive models; Multilayer perceptrons; Data models; Object recognition; Astronomy;

机译：培训;分析模型;预测模型;多层的感知;数据模型;物体识别;天文学;

相似文献

外文文献
中文文献
专利

1. Robust Machine Learning Applied to Astronomical Data Sets. I. Star-Galaxy Classification of the Sloan Digital Sky Survey DR3 Using Decision Trees [J] . Nicholas M. Ball12, Robert J. Brunner12, Adam D. Myers12, The Astrophysical journal . 2008,第1期

机译：强大的机器学习应用于天文数据集。 I.使用决策树对斯隆数字天空测量DR3进行星系分类
2. ROBUST MACHINE LEARNING APPLIED TO ASTRONOMICAL DATA SETS. I. STAR-GALAXY CLASSIFICATION OF THE SLOAN DIGITAL SKY SURVEY DR3 USING DECISION TREES [J] . NICHOLAS M. BALL, ROBERT J. BRUNNER, ADAM D. MYERS, The Astrophysical journal . 2006,第1Pt1期

机译：应用于天文数据集的鲁棒机器学习。 I.决策树对斯隆数字天空调查DR3的星系分类
3. ROBUST MACHINE LEARNING APPLIED TO ASTRONOMICAL DATA SETS. I. STAR-GALAXY CLASSIFICATION OF THE SLOAN DIGITAL SKY SURVEY DR3 USING DECISION TREES [J] . NICHOLAS M. BALL, ROBERT J. BRUNNER, ADAM D. MYERS, The Astrophysical journal . 2006,第1Pt1期

机译：应用于天文数据集的鲁棒机器学习。 I.决策树对斯隆数字天空调查DR3的星系分类
4. Multiclass Classification of Astronomical Objects in the Galaxy M81 using Machine Learning Techniques [C] . Tapanapong Chuntama, Prapaporn Techa-Angkoon, Chutipong Suwannajak, International Computer Science and Engineering Conference . 2020

机译：使用机器学习技术的Galaxy M81中天文对象的多牌分类
5. Supervised precision ordinal clustering – A human-machine learning algorithm to create accurate clusters in big datasets: Application to indiana water quality data with novel visualization techniques [D] . Singh, Sarabjit 2014

机译：有监督的有序序数聚类–一种人机学习算法，可在大型数据集中创建准确的聚类：采用新颖的可视化技术应用于印第安纳州水质数据
6. Application of Ultraviolet-Visible Absorption Spectroscopy with Machine Learning Techniques for the Classification of Cretan Wines [O] . Aggelos Philippidis, Emmanouil Poulakis, Renate Kontzedaki, 2021

机译：紫外线可见吸收光谱在机器学习技术中的应用克里特坦葡萄酒分类
7. Robust Machine Learning Applied to Astronomical Datasets I: Star-Galaxy Classification of the SDSS DR3 Using Decision Trees [O] . Ball Nicholas M., Brunner Robert J., Myers Adam D., 2006

机译：适用于天文数据集的稳健机器学习I：star-Galaxy 使用决策树分类sDss DR3

Classification of Astronomical Objects in the Galaxy M81 using Machine Learning Techniques II. An Application of Clustering in Data Pre-processing

摘要

著录项

相似文献

相关主题

期刊订阅