首页> 外文期刊>Expert systems with applications >Estimating Confidence Intervals For Structural Differences Between Contrast Groups With Missing Data
【24h】

Estimating Confidence Intervals For Structural Differences Between Contrast Groups With Missing Data

机译:估计缺少数据的对比组之间结构差异的置信区间

获取原文
获取原文并翻译 | 示例

摘要

Difference detection is actual and extremely useful for evaluating a new medicine B against a specified disease by comparing to an old medicine A, which has been used to treat the disease for many years. The datasets generated by applying A and B to the disease are called contrast groups and, main differences between the groups are the mean and distribution differences, referred to structural differences in this paper. However, contrast groups are only two samples obtained by limited applications or tests on A and B, and may be with missing values. Therefore, the differences derived from the groups are inevitably uncertain. In this paper, we propose a statistically sound approach for measuring this uncertainty by identifying the confidence intervals of structural differences between contrast groups. This method is designed significantly against most of those applications whose exact data distributions are unknown a priori, and the data may also be with missing values. We apply our approach to UCI datasets to illustrate its power as a new data mining technique for, such as, distinguishing spam from non-spam emails; and the benign breast cancer from the malign one.
机译:差异检测是实际的,通过与已经治疗多年的旧药物A进行比较,对于评估针对特定疾病的新药物B极为有用。通过将A和B应用于该疾病而生成的数据集称为对比组,这些组之间的主要差异是均值和分布差异,在本文中称为结构差异。但是,对比组仅是通过有限的应用或对A和B进行测试而获得的两个样本,并且可能缺少值。因此,从各组得出的差异不可避免地不确定。在本文中,我们提出了一种统计上合理的方法,通过识别对比组之间结构差异的置信区间来测量这种不确定性。该方法是针对那些先验未知确切数据分布的大多数应用程序而设计的,这些数据可能还缺少值。我们将方法应用于UCI数据集,以说明其作为一种新数据挖掘技术的功能,例如,区分垃圾邮件和非垃圾邮件;和恶性肿瘤之一。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号