Appropriate analyses of bimodal substance use frequency outcomes: a mixture model approach

Lane F. Burgette; Irineo Cabreros; Bing HanSusan M. Paddock

摘要

Background: In addiction research, outcome measures are often characterized by bimodal distributions. One mode can be for individuals with low substance use and the other mode for individuals with high substance use. Applying standard statistical procedures to bimodal data may result in invalid inference. Mixture models are appropriate for bimodal data because they assume that the sampled population is composed of several underlying subpopulations.Objectives: To introduce a novel mixture modeling approach to analyze bimodal substance use frequency data.Methods: We reviewed existing models used to analyze substance use frequency outcomes and developed multiple alternative variants of a finite mixture model. We applied all methods to data from a randomized controlled study in which 30-day alcohol abstinence was the primary outcome. Study data included 73 individuals (38 men and 35 women). Models were implemented in the software packages SAS, Stata, and Stan.Results: Shortcomings of existing approaches include: 1) inability to model outcomes with multiple modes, 2) invalid statistical inferences, including anti-conservative p-values, 3) sensitivity of results to the arbitrary choice to model days of substance use versus days of substance abstention, and 4) generation of predictions outside the range of common substance use frequency outcomes. Our mixture model variants avoided all of these shortcomings.Conclusions: Standard models of substance use frequency outcomes can be problematic, sometimes overstating treatment effectiveness. The mixture models developed improve the analysis of bimodal substance use frequency.

机译：背景：在成瘾研究中，结果测量通常以双峰分布为特征。一种模式可以适用于物质使用量低的人，另一种模式适用于物质使用量高的人。对双峰数据应用标准统计程序可能会导致无效的推断。混合模型适用于双峰数据，因为它们假定采样总体由多个基础子群体组成。目的：引入一种新的混合物建模方法来分析双峰物质使用频率数据。方法：我们回顾了用于分析物质使用频率结果的现有模型，并开发了有限混合物模型的多种替代变体。我们将所有方法应用于一项随机对照研究的数据，其中30天戒酒是主要结局。研究数据包括73名受试者（38名男性和35名女性）。结果：现有方法的缺点包括：1）无法用多种模式对结果进行建模，2）无效的统计推论，包括反保守的p值，3）结果对任意选择的敏感性，以模拟物质使用天数与戒毒天数，以及4）生成超出常见物质使用频率结果范围的预测。我们的混合模型变体避免了所有这些缺点。结论：物质使用频率结果的标准模型可能存在问题，有时会夸大治疗效果。开发的混合物模型改进了对双峰物质使用频率的分析。

Appropriate analyses of bimodal substance use frequency outcomes: a mixture model approach

摘要

著录项

引文网络

相关主题

期刊订阅