首页> 外文学位 >Sky Mining - Application to Photomorphic Redshift Estimation.
【24h】

Sky Mining - Application to Photomorphic Redshift Estimation.

机译:天空采矿-在变态红移估计中的应用。

获取原文
获取原文并翻译 | 示例

摘要

The field of astronomy has evolved from the ancient craft of observing the sky. In it's present form, astronomers explore the cosmos not just by observing through the tiny visible window used by our eyes, but also by exploiting the electromagnetic spectrum from radio waves to gamma rays. The domain is undoubtedly at the forefront of data-driven science. The data growth rate is expected to be around 50%--100% per year. This data explosion is attributed largely to the large-scale wide and deep surveys of the different regions of the sky at multiple wavelengths (both ground and space-based surveys).;This dissertation describes the application of machine learning methods to the estimation of galaxy redshifts leveraging such a survey data. Galaxy is a large system of stars held together by mutual gravitation and isolated from similar systems by vast regions of space. Our view of the universe is closely tied to our understanding of galaxy formation. Thus, a better understanding of the relative location of the multitudes of galaxies is crucial. The position of each galaxy can be characterized using three coordinates. Right Ascension (ra) and Declination (dec) are the two coordinates that locate the galaxy in two dimensions on the plane of the sky. It is relatively straightforward to measure them. In contrast, fixing the third coordinate that is the galaxy's distance from the observer along the line of sight (redshift 'z') is considerably more challenging.;"Spectroscopic redshift" method gives us accurate and precise measurements of z. However, it is extremely time-intensive and unusable for faint objects. Additionally, the rate at which objects are being identified via photometric surveys far exceeds the rate at which the spectroscopic redshift measurements can keep pace in determining their distance. As the surveys go deeper into the sky, the proportion of faint objects being identified also continues to increase. In order to tackle both these drawbacks increasing in severity every day, alternative method "Photometric redshift" has been studied in the past. It uses the brightness of the object viewed through various standard filters, each of which lets through a relatively broad spectrum of colors. However, these methods are bound by the degeneracy problem (objects with different color profiles have the same redshift) which leads to low predictive accuracy.;As part of our study, we are looking beyond color attributes to identify other measured attributes as degeneracy resolvers as well as generate estimators that are highly accurate; termed as "Photomorphic redshift" estimators. The present study investigates the photometric information of the objects such as color and magnitude (= observed flux) and morphology attributes such as shape, size, orientation and concentration in the different wavelengths. The specific type of magnitude used in this study are the PSF, Fiber and Petrosian magnitude. The morphology attributes are the ratio of Fiber to Petrosian magnitude, concentration index and Petrosian radius. All these attributes are in the five bands ugriz of the Sloan Digital Sky Survey (SDSS).;Machine learning techniques based on Naive Bayes (NB), Bayesian Network (BN) and Generalized Linear Model (GLM) are researched to better understand their applicability, advantages and resulting predictive performance in terms of efficiency and accuracy. Note: The SDSS Data Release (DR) 10 data was used in the executed experiments (total of 700,777 galaxies with forty-five attributes associated with each galaxy).;The significant findings of the present work are as follows: 1. Magnitude and morphology attributes have been found to be successful degeneracy resolvers. 2. Magnitude and morphology attributes have been found to be better redshift estimators than color attributes alone. 3. Naive Bayes, Bayesian Network and GLM have been found to be viable redshift estimation methods. Attribute selection is an important factor in computational performance. 4. In addition to the redshift estimate, the likelihood distribution of the estimate is even more useful, and my Bayesian Network models provide that information. This is particularly useful in ensemble methods as well as the kernel for mass distribution in the universe. 5. The generated Bayesian Network models can be applied to any of the variables, not just limited to redshift. Example applications include quality analysis and missing value imputation. Different types of Bayesian Network learning algorithms---constraint-based, score-based and hybrid---were investigated in detail.
机译:天文学领域已经从古老的观测天空技术演变而来。以目前的形式,天文学家不仅通过观察我们眼睛所用的微小可见窗口来观察宇宙,而且还利用从无线电波到伽马射线的电磁光谱进行探索。毫无疑问,该领域处于数据驱动科学的最前沿。预计数据增长率为每年50%-100%。数据爆炸的主要原因是在多个波长下对天空的不同区域进行了大规模的广泛和深入的调查(地面调查和天基调查)。;本文介绍了机器学习方法在银河系估计中的应用利用此类调查数据进行红移。星系是由相互引力保持在一起的大型恒星系统,并且与广阔空间区域的类似系统隔离。我们对宇宙的看法与对星系形成的理解紧密相关。因此,更好地理解众多星系的相对位置至关重要。每个星系的位置可以使用三个坐标来表征。右升数(ra)和赤纬(dec)是在天空平面上二维定位星系的两个坐标。测量它们相对简单。相比之下,固定第三个坐标即银河系沿着视线到观察者的距离(红移“ z”)要困难得多。;“光谱红移”方法为我们提供了z的精确测量。但是,这非常耗时并且对于微弱的对象不可用。另外,通过光度学调查识别物体的速度远远超过光谱红移测量可以保持其确定距离的速度。随着勘测工作深入天空,被识别出的微弱物体的比例也继续增加。为了解决每天都在增加严重性的这两个缺点,过去已经研究了替代方法“光度红移”。它使用通过各种标准滤镜查看的对象的亮度,每个滤镜都可以通过相对较宽的颜色范围。但是,这些方法受到退化问题(具有不同颜色配置文件的对象具有相同的红移)的约束,这导致较低的预测精度。;作为我们研究的一部分,我们正在寻找超越颜色属性以将其他测量属性识别为退化分解器的问题。以及生成高度准确的估算器;称为“光变红移”估计器。本研究调查对象的光度信息,例如颜色和大小(=观察到的通量)以及形态属性,例如不同波长下的形状,大小,方向和浓度。本研究中使用的特定量级类型是PSF,纤维和Petrosian量级。形态属性是纤维与石油的比值,浓度指数和石油的半径。所有这些属性都在Sloan数字天空调查(SDSS)的五个波段ugriz中;研究了基于朴素贝叶斯(NB),贝叶斯网络(BN)和广义线性模型(GLM)的机器学习技术,以更好地了解它们的适用性效率和准确性方面的优势以及由此产生的预测性能。注意:在执行的实验中使用了SDSS Data Release(DR)10数据(总共700,777个星系,每个星系有45个属性)。本研究的重要发现如下:1.大小和形态已发现属性是成功的简并解析器。 2.已经发现,与单独的颜色属性相比,幅度和形态属性是更好的红移估计量。 3.已发现朴素贝叶斯,贝叶斯网络和GLM是可行的红移估计方法。属性选择是计算性能的重要因素。 4.除了红移估计之外,估计的似然分布甚至更有用,我的贝叶斯网络模型提供了该信息。这在合奏方法以及宇宙中质量分布的内核中特别有用。 5.生成的贝叶斯网络模型可以应用于任何变量,而不仅限于红移。示例应用程序包括质量分析和缺失值估算。详细研究了不同类型的贝叶斯网络学习算法-基于约束,基于分数和混合的。

著录项

  • 作者

    Nayak, Pragyansmita.;

  • 作者单位

    George Mason University.;

  • 授予单位 George Mason University.;
  • 学科 Computer science.;Astronomy.;Statistics.
  • 学位 Ph.D.
  • 年度 2015
  • 页码 229 p.
  • 总页数 229
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号