首页> 外文会议>International Joint Conference on Neural Networks >CIDMP: Completely Interpretable Detection of Malaria Parasite in Red Blood Cells using Lower-dimensional Feature Space
【24h】

CIDMP: Completely Interpretable Detection of Malaria Parasite in Red Blood Cells using Lower-dimensional Feature Space

机译:CIDMP:使用低维特征空间完全可解释地检测红细胞中的疟原虫

获取原文

摘要

Predicting if red blood cells (RBC) are infected with the malaria parasite is an important problem in Pathology. Recently, supervised machine learning approaches have been used for this problem, and they have had reasonable success. In particular, state-of-the-art methods such as Convolutional Neural Networks automatically extract increasingly complex feature hierarchies from the image pixels. While such generalized automatic feature extraction methods have significantly reduced the burden of feature engineering in many domains, for niche tasks such as the one we consider in this paper, they result in two major problems. First, they use a very large number of features (that may or may not be relevant) and therefore training such models is computationally expensive. Further, more importantly, the large feature-space makes it very hard to interpret which features are truly important for predictions. Thus, a criticism of such methods is that learning algorithms pose as opaque blackboxes to its users, in this case medical experts. The recommendation of such algorithms can be understood easily, but the reason for their recommendation is not clear. This is the problem of non-interpretability of the model, and the best-performing algorithms are usually the least interpretable. To address these issues, in this paper, we propose an approach to extract a very small number of aggregated features that are easy to interpret and compute, and empirically show that we obtain high prediction accuracy even with a significantly reduced feature-space.
机译:预测红细胞(RBC)是否感染疟疾寄生虫是病理学中的重要问题。最近,监督机器学习方法已被用于这个问题,而且他们取得了合理的成功。特别地,诸如卷积神经网络的最先进的方法,从图像像素自动提取越来越复杂的特征层次结构。虽然这种广泛的自动特征提取方法在许多域中的特征工程的负担显着降低,但对于我们在本文中考虑的利基任务,它们导致了两个主要问题。首先,他们使用非常大量的功能(可能或可能不是相关的),因此培训这种模型是计算昂贵的。此外,更重要的是,大型特征空间使得很难解释哪些功能对于预测来说真正重要。因此,对这种方法的批评是,在这种情况下,学习算法占据了不透明的黑盒,在这种情况下,在这种情况下,医学专家。可以容易地理解这种算法的建议,但他们推荐的原因尚不清楚。这是模型不可解释的问题,并且最好的性能算法通常是最不可解释的。为了解决这些问题,在本文中,我们提出了一种方法来提取易于解释和计算的非常少量的聚合特征,并且经验表明我们即使具有显着减少的特征空间,我们也能获得高预测准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号