首页> 外文期刊>BMC Medical Research Methodology >Application of latent semantic analysis for open-ended responses in a large, epidemiologic study
【24h】

Application of latent semantic analysis for open-ended responses in a large, epidemiologic study

机译:潜在语义分析在大型流行病学研究中开放式反应的应用

获取原文
           

摘要

Background The Millennium Cohort Study is a longitudinal cohort study designed in the late 1990s to evaluate how military service may affect long-term health. The purpose of this investigation was to examine characteristics of Millennium Cohort Study participants who responded to the open-ended question, and to identify and investigate the most commonly reported areas of concern. Methods Participants who responded during the 2001-2003 and 2004-2006 questionnaire cycles were included in this study (n = 108,129). To perform these analyses, Latent Semantic Analysis (LSA) was applied to a broad open-ended question asking the participant if there were any additional health concerns. Multivariable logistic regression was performed to examine the adjusted odds of responding to the open-text field, and cluster analysis was executed to understand the major areas of concern for participants providing open-ended responses. Results Participants who provided information in the open-ended text field (n = 27,916), had significantly lower self-reported general health compared with those who did not provide information in the open-ended text field. The bulk of responses concerned a finite number of topics, most notably illness/injury, exposure, and exercise. Conclusion These findings suggest generalized topic areas, as well as identify subgroups who are more likely to provide additional information in their response that may add insight into future epidemiologic and military research.
机译:背景技术千年队列研究是一项纵向队列研究,在20世纪90年代后期设计,以评估军事服务如何影响长期健康。该调查的目的是审查千年队列研究参与者的特征,他们回应了开放式问题,并确定并调查了最常见的关注领域。方法在本研究中包含在2001-2003和2004-2006问卷周期中回复的参与者(n = 108,129)。为了执行这些分析,应用潜在语义分析(LSA)在询问参与者是否存在任何额外的健康问题时应用于广泛的开放式问题。进行多变量逻辑回归以检查对开放文本领域的响应的调整赔率,并执行集群分析,以了解提供开放响应的参与者的主要领域。结果参与者在未结束的文本字段(n = 27,916)中提供的参与者与未在未结束的文本领域提供信息的人相比,自我报告的一般健康状况明显降低。大部分反应涉及有限数量的主题,最符合疾病/伤害,暴露和运动。结论这些调查结果提出了广泛的主题领域,以及识别更有可能在其答复中提供额外信息的子群,这可能会深入了解未来流行病学和军事研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号