A Review of Best Practice Recommendations for Text Analysis in R (and a User-Friendly App)

Banks George C.; Woznyj Haley M.; Wesslen Ryan S.; Ross Roxanne L.

首页> 外文期刊>Journal of business and psychology fsponsored by the Business Psychology Research Institute >A Review of Best Practice Recommendations for Text Analysis in R (and a User-Friendly App)

【24h】

A Review of Best Practice Recommendations for Text Analysis in R (and a User-Friendly App)

机译：R（以及用户友好的应用程序的文本分析最佳实践建议述评

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In recent decades, the amount of text available for organizational science research has grown tremendously. Despite the availability of text and advances in text analysis methods, many of these techniques remain largely segmented by discipline. Moreover, there is an increasing number of open-source tools (R, Python) for text analysis, yet these tools are not easily taken advantage of by social science researchers who likely have limited programming knowledge and exposure to computational methods. In this article, we compare quantitative and qualitative text analysis methods used across social sciences. We describe basic terminology and the overlooked, but critically important, steps in pre-processing raw text (e.g., selection of stop words; stemming). Next, we provide an exploratory analysis of open-ended responses from a prototypical survey dataset using topic modeling with R. We provide a list of best practice recommendations for text analysis focused on (1) hypothesis and question formation, (2) design and data collection, (3) data pre-processing, and (4) topic modeling. We also discuss the creation of scale scores for more traditional correlation and regression analyses. All the data are available in an online repository for the interested reader to practice with, along with a reference list for additional reading, an R markdown file, and an open source interactive topic model tool (topicApp; see https://github.com/wesslen/topicApp, https://github.com/wesslen/text-analysis-org-science, https://dataverse.unc.edu/dataset.xhtml?persistentId=doi:10.15139/S3/R4W7ZS).

机译：近几十年来，可用于组织科学研究的文本数量巨大地增长。尽管文本分析方法中的文本和进步，但这些技术中的许多技术仍然很大程度上被纪律分割。此外，文本分析存在越来越多的开源工具（R，Python），但这些工具不容易受到社会科学研究人员的优势，他们可能具有有限的编程知识和曝光计算方法。在本文中，我们比较了社会科学中使用的定量和定性文本分析方法。我们描述了基本术语和被忽视但批判性的重要性，步骤在预处理原始文本（例如，选择停止单词;茎干）。接下来，我们提供使用与R主题建模的原型调查数据集的开放式响应的探索性分析。我们提供了专注于（1）假设和问题的文本分析的最佳实践建议列表，（2）设计和数据集合，（3）数据预处理，（4）主题建模。我们还讨论了更传统的相关性和回归分析的规模分数的创建。所有数据都可以在线存储库中使用，用于感兴趣的读者才能与参考列表一起练习，以及其他读取，R Markdown文件和开源交互式主题模型工具（TopicApp;查看https://github.com / wesslen /主题点，https://github.com/wesslen/text-analysis-orce，https：//dataverse.unc.edu/dataset.xhtml?persistentid=doi:10.15139/s3/r4w7zs）。

著录项

来源
《Journal of business and psychology fsponsored by the Business Psychology Research Institute》 |2018年第4期|共15页
作者
Banks George C.; Woznyj Haley M.; Wesslen Ryan S.; Ross Roxanne L.;
展开▼
作者单位

Univ N Carolina Belk Coll Business Dept Management 9201 Univ City Blvd Charlotte NC 28223 USA;

Longwood Univ Dept Management Farmville VA USA;

Univ N Carolina Dept Comp Sci Charlotte NC 28223 USA;

Univ N Carolina Dept Org Sci Charlotte NC 28223 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类心理学;
关键词
Text analysis; Topic modeling; Structural topic modeling; Thematic analysis; Content-analysis; Dictionary analysis; Natural language processing;

机译：文本分析;主题建模;结构主题建模;主题分析;内容分析;字典分析;自然语言处理;

相似文献

外文文献
中文文献
专利

1. A Review of Best Practice Recommendations for Text Analysis in R (and a User-Friendly App) [J] . Banks George C., Woznyj Haley M., Wesslen Ryan S., Journal of business and psychology fsponsored by the Business Psychology Research Institute . 2018,第4期

机译：R（以及用户友好的应用程序的文本分析最佳实践建议述评
2. Identifying patient-centred recommendations for improving patient safety in General Practices in England: a qualitative content analysis of free-text responses using the Patient Reported Experiences and Outcomes of Safety in Primary Care (PREOS-PC) questionnaire [J] . Ricci-Cabello Ignacio, Saletti-Cuesta Lorena, Slight Sarah P., Health expectations: an international journal of public participation in health care and health policy . 2017,第5期

机译：确定患者为改善英格兰的一般实践中的患者安全的患者安全性建议：使用患者的自由文本反应的定性内容分析报告初级保健（PROS-PC）问卷中的安全经验和结果
3. Survey of radioiodine therapy safety practices highlights the need for user-friendly recommendations. [J] . Kloos RT Thyroid: official journal of the American Thyroid Association . 2011,第2期

机译：放射性碘疗法安全性实践调查突出了对用户友好建议的需求。
4. Accident analysis in practice: A review of Human Factors Analysis and Classification System (HFACS) applications in the peer reviewed academic literature [C] . Adam Hulme, Neville A. Stanton, Guy H. Walker, Human Factors and Ergonomics Society Annual Meeting . 2019

机译：实践中的事故分析：对同行评审的人类因素分析和分类系统（HFACS）申请述评综述学术文献
5. Review of practices and recommendations for a database system for managing municipal roads (French text). [D] . Babb, Stephen. 2002

机译：审查用于管理市政道路的数据库系统的做法和建议（法语文本）。
6. Identifying patient‐centred recommendations for improving patient safety in General Practices in England: a qualitative content analysis of free‐text responses using the Patient Reported Experiences and Outcomes of Safety in Primary Care (PREOS‐PC) questionnaire [O] . Ignacio Ricci‐Cabello, Lorena Saletti‐Cuesta, Sarah P. Slight, 2017

机译：在英格兰的通用实践中确定以患者为中心的改善患者安全的建议：使用患者报告的初级保健安全经验和结果（PREOS-PC）调查表对自由文本响应进行定性内容分析
7. Closed and Open Vocabulary Approaches to Text Analysis: A Review, Quantitative Comparison, and Recommendations [O] . johannes Christopher Eichstaedt, Margaret L. Kern, David Bryce Yaden, 2020

机译：文本分析的封闭和开放词汇方法：审查，定量比较和建议
8. Recommendations for Energy Conservation Standards and Guidelines for New Commercial Buildings. Volume 1. Text of the Recommendations. Appendix A. Side-by-Side Comparison of the Recommendations and 90A-1980 [R] . Pressnall, J. S., Fitzpatrick, J. J., Predecki, P. 1983

机译：关于节能标准和新商业建筑指南的建议。第1卷。建议书的文本。附录a.建议与90a-1980的并列比较

A Review of Best Practice Recommendations for Text Analysis in R (and a User-Friendly App)

摘要

著录项

相似文献

相关主题

期刊订阅