【24h】

A Comparison of Unsupervised Classifiers on BATSE Catalog Data

机译:BATSE目录数据上无监督分类器的比较

获取原文
获取原文并翻译 | 示例

摘要

We classify BATSE gamma-ray bursts using unsupervised clustering algorithms in order to compare classification with statistical clustering techniques. BATSE bursts detected with homogeneous trigger criteria and measured with a limited attribute set (duration, hardness, and fluence) are classified using four unsupervised algorithms (the concept hierarchy classifier ESX, the EM algorithm, the Kmeans algorithm, and a kohonen neural network). The classifiers prefer three-class solutions to two-class and four-class solutions. When forced to find two classes, the classifiers do not find the traditional long and short classes; many short soft events are placed in a class with the short hard bursts. When three classes are found, the classifiers clearly identify the short bursts, but place far more members in an intermediate duration soft class than have been found using statistical clustering techniques. It appears that the boundary between short faint and long bright bursts is more important to the classifiers than is the boundary between short hard and long soft bursts. We conclude that the boundary between short faint and long hard bursts is the result of data bias and poor attribute selection. We recommend that future gamma-ray burst classification avoid using extrinsic parameters such as fluence, and should instead concentrate on intrinsic properties such as spectral, temporal, and (when available) luminosity characteristics. Future classification should also be wary of correlated attributes (such as fluence and duration), as these bias classification results.
机译:我们使用无监督聚类算法对BATSE伽马射线暴进行分类,以便将分类与统计聚类技术进行比较。使用四种无监督算法(概念层次分类器ESX,EM算法,Kmeans算法和kohonen神经网络)对以均质触发标准检测到并以有限属性集(持续时间,硬度和能量密度)测量的BATSE突发进行分类。分类器更喜欢三级解决方案,而不是两级和四级解决方案。当被迫找到两个类别时,分类器不会找到传统的长和短类别。许多短暂的软事件被安排在短暂的硬突发中。当找到三个类别时,分类器可以清楚地识别出短脉冲串,但是与使用统计聚类技术所发现的相比,在中等持续时间的软类别中放置的成员要多得多。看起来,短暂的暗淡和长亮脉冲之间的边界对分类器比短的硬和长软脉冲之间的边界更为重要。我们得出的结论是,短时模糊和长时硬突发之间的边界是数据偏差和较差的属性选择的结果。我们建议将来的伽马射线暴分类避免使用诸如通量之类的外部参数,而应专注于固有特性,例如光谱,时间和(如果可用)发光度特性。将来的分类也应警惕相关属性(例如通量和持续时间),因为这些偏差会导致分类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号