首页> 外文学位 >Bayesian data mining techniques in public health and biomedical applications.
【24h】

Bayesian data mining techniques in public health and biomedical applications.

机译:公共卫生和生物医学应用中的贝叶斯数据挖掘技术。

获取原文
获取原文并翻译 | 示例

摘要

This thesis is focused on Bayesian data mining techniques for non-traditional data in the health care and biomedical fields, and comprises three main topics.;Meta-analysis of clinical trials with incomplete data set.;This methodology is motivated by meta-analysis that compares the effectiveness of glass-ionomer (GI) to resin-based (RB) sealants in preventing caries in permanent molars of children. While GI and RB sealants are two of the most commonly used dental sealant materials, there is strong debate regarding their comparative effectiveness. Although GI to RB sealant meta-analyses have been performed in the past, this research is the first to find difference by including studies that were excluded previously solely because they failed to report data in a paired manner. The conclusion is based on 12 clinical studies performed on 2,012 children, that had incomplete information and inconsistent conclusions.;To retrieve full paired table information from studies that only reported marginal totals, we propose a Bayesian approach based on Metropolis random walk and a hierarchical Bayesian model. The procedure relies on Markov chain Monte Carlo simulation, recovers a full table and reports its accuracy in terms of credible sets for the cell counts. The performance of the model is remarkable; from 17 matched-pair study examples, four of the estimated tables coincided exactly with the original tables and nine tables deviated only (+1,-1) in terms of cell counts.;Analysis of spatial repeated measures with excess zero and no covariate information.;Water fluoridation, the addition of fluoride to drinking water to prevent tooth decay, is listed as one of ten greatest public health achievements of the 20th century by the Centers for Disease Control and Prevention. However, fluoride is present naturally in water in concentrations well above recommended levels, which may have several long-term adverse effects. Therefore, describing the availability of natural fluoride by county can inform public health efforts in prevention of tooth decay and dental fluorosis.;This work is to estimate United States county level natural fluoride occurrence in the ground water. To accommodate the limitations of the data, we propose a two-step hierarchical Bayesian approach. It models spatial repeated measures with excess zeroes and it is robust even when covariate information is not available. The approach starts with: (i) an autologistic model that estimates the probability of zero observations, and proceeds to (ii) an excess-zero model that describes the underlying physical phenomena of the zero-inflated data. The proposed hierarchical Bayesian structure produces improved discretized estimates and reflects the continuous behavior of the natural fluoride occurrence better. By employing the hyper prior structure, state level estimates can be obtained as well. Furthermore, the model can answer pragmatic questions such as the probability that a specific county has naturally occurring fluoride above EPA recommended level".;Image classification based on the overall image regularity in wavelet domain.;Breast cancer is one of the most common forms of cancer among women in the United States; an estimated 1 in 8 women born today will be diagnosed during her lifetime. Since the causes of breast cancer are not yet fully understood, early detection is still the best strategy for improving prognoses. Mammography is currently the most effective method for detecting breast cancer early; however, radiological interpretation of mammogram images is a challenging task. The appearance of even normal tissue is highly variable and complex, and signs of early disease are often indistinct.;We propose a diagnostic based on the properties of overall image backgrounds; this procedure currently is an unused diagnostic modality in mammograms. Generally speaking, normal/healthy images tend to be more irregular than cancer images. The overall regularity of the image is assessed through wavelet analysis, which is then summarized by a few measures. These estimators are evaluated based on their ability to classify digitized mammogram images from a clinical database, for which the true disease status is known by biopsy. This research presents a two-fold approach: (i) generalization of the covariance wavelet spectra to the complex domain and (ii) the estimation of Hurst parameter and phase information as discriminatory descriptors. The most accurate classification rates from this work achieve 86%; these rates vary slightly with the choice of wavelet basis, levels used and size of training set.
机译:本文主要针对医疗保健和生物医学领域的非传统数据的贝叶斯数据挖掘技术,并包括三个主要主题。不完整数据集的临床试验的元分析。比较了玻璃离聚物(GI)和树脂基(RB)密封胶在预防儿童恒磨牙中龋齿的功效。虽然GI和RB密封胶是最常用的两种牙科密封胶材料,但它们的相对有效性仍存在争议。尽管过去已经进行了GI到RB密封剂的荟萃分析,但这项研究是第一个发现差异的方法,该研究纳入了以前被排除在外的研究,完全是因为它们未能以成对的方式报告数据。该结论基于对1,012名儿童进行的12项临床研究,这些研究信息不完整且结论不一致。;要从仅报告边际总数的研究中检索完全配对的表信息,我们提出了一种基于Metropolis随机游走和分级贝叶斯的贝叶斯方法模型。该过程依赖于马尔可夫链蒙特卡罗模拟,恢复完整表格并以可靠的细胞计数集报告其准确性。该模型的性能非常出色;从17个配对研究示例中,估计出的四个表与原始表完全重合,而九个表在单元格计数方面仅相差(+ 1,-1)。 ..水氟化,即在饮用水中添加氟化物以​​防止蛀牙,被疾病控制与预防中心列为20世纪十大公共卫生成就之一。但是,水中的氟化物天然存在的浓度远高于建议的浓度,这可能会产生一些长期的不利影响。因此,描述各县的天然氟化物的可用性可以为预防牙齿龋齿和氟中毒的公共卫生工作提供信息。这项工作是为了估计美国县级地下水中天然氟化物的发生。为了适应数据的局限性,我们提出了两步分层贝叶斯方法。它使用多余的零对空间重复测量进行建模,即使没有协变量信息,它也很健壮。该方法始于:(i)估计零观测值概率的自物流模型,然后进行到(ii)描述零膨胀数据的潜在物理现象的超零模型。提出的分层贝叶斯结构产生了改进的离散估计,并更好地反映了天然氟化物的连续行为。通过采用超先验结构,也可以获得状态级别估计。此外,该模型还可以回答一些实用的问题,例如特定郡县自然产生的氟超过EPA推荐水平的可能性。”;基于小波域总体图像规律性的图像分类。;乳腺癌是最常见的一种形式在美国女性中,癌症的发生;据估计,目前有八分之一的女性今天将在其一生中被诊断出,由于尚未完全了解乳腺癌的原因,因此,早期发现仍是改善预后的最佳策略。早期发现乳腺癌的最有效方法;但是,对乳房X线照片进行放射学解释是一项艰巨的任务,即使正常组织的外观也高度可变且复杂,而且早期疾病的体征通常难以区分。总体图像背景的属性;此过程当前是乳房X线照片中未使用的诊断方式。 l /健康图像往往比癌症图像更不规则。图像的整体规律性通过小波分析进行评估,然后通过一些措施进行总结。这些估计器是根据其对临床数据库中的数字化乳房X线照片图像进行分类的能力进行评估的,对于这些估计值,活检可以得知真正的疾病状态。这项研究提出了两种方法:(i)将协方差小波谱推广到复数域,(ii)估计Hurst参数和相位信息作为区分性描述符。这项工作中最准确的分类率达到86%;这些频率随小波基础,使用的级别和训练集的大小的选择而略有不同。

著录项

  • 作者

    Jeon, Seonghye.;

  • 作者单位

    Georgia Institute of Technology.;

  • 授予单位 Georgia Institute of Technology.;
  • 学科 Statistics.;Engineering Industrial.;Health Sciences Public Health.
  • 学位 Ph.D.
  • 年度 2012
  • 页码 113 p.
  • 总页数 113
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号