首页> 外文期刊>International Journal of Qualitative Methods >Natural Language Processing (NLP) in Qualitative Public Health Research: A Proof of Concept Study
【24h】

Natural Language Processing (NLP) in Qualitative Public Health Research: A Proof of Concept Study

机译:定性公共卫生研究中的自然语言处理(NLP):概念研究证明

获取原文
           

摘要

Qualitative data-analysis methods provide thick, rich descriptions of subjects’ thoughts, feelings, and lived experiences but may be time-consuming, labor-intensive, or prone to bias. Natural language processing (NLP) is a machine learning technique from computer science that uses algorithms to analyze textual data. NLP allows processing of large amounts of data almost instantaneously. As researchers become conversant with NLP, it is becoming more frequently employed outside of computer science and shows promise as a tool to analyze qualitative data in public health. This is a proof of concept paper to evaluate the potential of NLP to analyze qualitative data. Specifically, we ask if NLP can support conventional qualitative analysis, and if so, what its role is. We compared a qualitative method of open coding with two forms of NLP, Topic Modeling, and Word2Vec to analyze transcripts from interviews conducted in rural Belize querying men about their health needs. All three methods returned a series of terms that captured ideas and concepts in subjects’ responses to interview questions. Open coding returned 5–10 words or short phrases for each question. Topic Modeling returned a series of word-probability pairs that quantified how well a word captured the topic of a response. Word2Vec returned a list of words for each interview question ordered by which words were predicted to best capture the meaning of the passage. For most interview questions, all three methods returned conceptually similar results. NLP may be a useful adjunct to qualitative analysis. NLP may be performed after data have undergone open coding as a check on the accuracy of the codes. Alternatively, researchers can perform NLP prior to open coding and use the results to guide their creation of their codebook.
机译:定性数据分析方法提供厚重,丰富的受试者的思想,感受和生活经验,但可能是耗时的,劳动密集型或易于偏见。自然语言处理(NLP)是一种计算机科学的机器学习技术,它使用算法来分析文本数据。 NLP几乎瞬间处理大量数据。随着研究人员与NLP进行了熟悉,它越来越经常在计算机科学外部使用,并将承诺作为分析公共卫生的定性数据的工具。这是概念纸的证据,以评估NLP的潜力来分析定性数据。具体而言,我们询问NLP是否可以支持传统的定性分析,如果是,它的作用是什么。我们比较了具有两种形式的NLP,主题建模和Word2VEC的开放编码的定性方法,以分析来自农村伯利兹查询男性的采访的成绩单。所有三种方法都返回了一系列术语,即在受试者的回答中捕获了思想和概念来面试问题。开放编码为每个问题返回5-10个单词或短语。主题建模返回了一系列单词概率对,这些对量化了一个单词捕获响应主题的单词。 Word2VEC返回了每个面试问题的单词列表,其中按照预先捕获该词最佳捕获该段落的含义。对于大多数面试问题,所有三种方法都归还了概念上类似的结果。 NLP可能是有用的定性分析的辅助。可以在数据经历开放编码后执行NLP作为检查代码的准确性。或者,研究人员可以在打开编码之前执行NLP,并使用结果指导他们创建其码本。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号