EVALUATING SAMPLING METHODS FOR REUSING KNOWLEDGE FROM LARGE AND ILL-STRUCTURED QUALITATIVE DATA SETS

机译：重用大量结构不良的定性数据集中的知识的评估采样方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The desire to use ever growing qualitative data sets of user generated content in the engineering design process in a computationally effective manner makes it increasingly necessary to draw representative samples. This work investigated the ability of alternative sampling algorithms to draw samples with conformance to characteristics of the original data set. Sampling methods investigated included: random sampling, interval sampling, fixed-increment (or systematic) sampling method, and stratified sampling. Data collected through the Vehicle Owner's Questionnaire, a survey administered by the U.S. National Highway Traffic Safety Administration, is used as a case study throughout this paper. The paper demonstrates that existing statistical methods may be used to evaluate goodness of fit for samples drawn from large bodies of qualitative data. Evaluation of goodness of fit not only provides confidence that a sample is representative of the data set from which it is drawn, but also yields valuable realtime feedback during the sampling process. This investigation revealed two interesting and counterintuitive trends in sampling algorithm performance. The first is that larger sample sizes do not necessarily lead to improved goodness of fit. The second is that depending on the details of implementation, data cleansing may degrade performance of data sampling algorithms rather than improving it. This work illustrates the importance of aligning sampling procedures to data structures and validating the conformance of samples to characteristics of the larger data set to avoid drawing erroneous conclusions based on unexpectedly biased samples of data.

机译：在工程设计过程中以计算上有效的方式使用用户生成的内容的不断增长的定性数据集的需求使得越来越有必要绘制代表性样本。这项工作研究了替代采样算法提取符合原始数据集特征的样本的能力。研究的抽样方法包括：随机抽样，间隔抽样，固定增量（或系统）抽样方法和分层抽样。通过“车主问卷调查”（由美国国家公路交通安全管理局管理的一项调查）收集的数据在整个本文中均用作案例研究。本文证明，现有的统计方法可用于评估从大量定性数据中提取的样本的拟合优度。拟合优度的评估不仅可以确保样品代表从中抽取数据的代表，而且还可以在采样过程中提供有价值的实时反馈。这项调查揭示了采样算法性能方面的两个有趣且违反直觉的趋势。首先是较大的样本量并不一定会导致拟合优度的提高。第二个问题是，根据实现的细节，数据清理可能会降低而不是改善数据采样算法的性能。这项工作说明了使采样程序与数据结构保持一致，并验证样本与较大数据集的特征的一致性的重要性，以避免基于出乎意料的数据样本得出错误的结论。

著录项

来源
《Computers and information in engineering conference;ASME international design engineering technical conferences and computers and information in engineering conference》|2017年|V001T02A057.1-V001T02A057.10|共10页
会议地点
作者
Jacob Nelson; G. Austin Marrs; Greg Schmidt; Joseph A. Donndelinger; Robert L. Nagel;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. ) and Neuropsychology (NP). METHODS: The paper is based on survey data collected from a sample of GPs (n = 300) registered with the Irish College of General Practitioners (ICGP) and on qualitative data collected from a Focus Group (n = 7). RESULTS: GPs were more likely to blame themselves than either the health care system, their patients or family members for the late presentation of dementia in primary care. Stigma was a major obstacle preventing GPs from being more proactive in this area. Rur [J] . Journal of chemical neuroanatomy . 2009,第2期

机译：）和神经心理学（NP）。方法：本文基于从爱尔兰全科医生学院（ICGP）注册的GP（n = 300）样本中收集的调查数据以及从焦点小组（n = 7）中收集的定性数据。结果：与初级卫生保健中迟发性痴呆症相比，全科医生比医疗保健系统，其患者或家属更有可能自责。污名化是阻碍全科医生在这一领域更加积极主动的主要障碍。鲁尔
2. Clustering Methods with Qualitative Data: a Mixed-Methods Approach for Prevention Research with Small Samples [J] . Henry David, Dymnicki Allison B., Mohatt Nathaniel, Prevention science: the official journal of the Society for Prevention Research . 2015,第7期

机译：定性数据的聚类方法：小样本预防研究的混合方法
3. Knowledge synthesis methods for integrating qualitative and quantitative data: a scoping review reveals poor operationalization of the methodological steps [J] . Tricco Andrea C., Antony Jesmin, Soobiah Charlene, Journal of Clinical Epidemiology . 2016,第期

机译：关于整合定性和定量数据的知识综合方法：范围审查揭示了方法学措施的差
4. EVALUATING SAMPLING METHODS FOR REUSING KNOWLEDGE FROM LARGE AND ILL-STRUCTURED QUALITATIVE DATA SETS [C] . Jacob Nelson, G. Austin Marrs, Greg Schmidt, ASME International Design Engineering Technical Conferences . 2017

机译：评估用于重用大型和结构定性数据集的知识的采样方法
5. Evaluation of population genetic structuring sampling methods using simulated microsatellite data [D] . Reavill, David Allen 2014

机译：利用模拟微卫星数据评估种群遗传结构抽样方法
6. Clustering Methods with Qualitative Data: A Mixed Methods Approach for Prevention Research with Small Samples [O] . David Henry, Allison B. Dymnicki, Nathaniel Mohatt, -1

机译：定性数据的聚类方法：小样本预防研究的混合方法
7. Political Culture of Coastal Society in the Hinterland Area of Batam CityThis research examines the political culture of coastal communities in the face of Simultaneous Regional Elections of 2015 in Batam City. The purpose of this research is to analyze the types of political culture and the participation of coastal society in political activities, especially elections, because there is no research that discusses the political culture of coastal society in particular. Research that discusses the political life of coastal society has not been discussed, especially how the role and views of society on political life and government. In fact, coastal society are groups that will be affected and feel the consequences of these political activities. This research uses qualitative method. The respondents are selected by purposive sampling technique and data obtained by observation and in-depth interview. The findings in this study indicate that the type of coastal society culture is included in the type of participant political culture, in which the level of participation society in Simultaneous Regional Elections of 2015 is quite high and their knowledge on political activity is sufficient. Keywords: Political Culture; Coastal Society; Participant Political Culture [O] . Diah Ayu Pratiwi, Meri Enita Puspita Sari 2018

机译：蝙蝠侠中腹海地区沿海社会的政治文化研究了2015年在蝙蝠侠城同时参见沿海社区的政治文化。本研究的目的是分析政治文化的类型和沿海社会参与政治活动，特别是选举，因为没有研究尤其是沿海社会的政治文化。讨论沿海社会政治生活的研究尚未讨论，特别是如何对政治生活和政府的作用和意见。事实上，沿海社会是将受到影响和感受这些政治活动的后果的群体。该研究采用定性方法。受访者通过目的采样技术和通过观察和深入访谈获得的数据来选择。本研究中的研究结果表明，沿海社会文化的类型包括在参与者政治文化的类型中，其中参与社会在2015年同时区域选举中的参与社会的水平相当高，他们对政治活动的知识就足够了。关键词：政治文化;沿海社会;参与者政治文化

EVALUATING SAMPLING METHODS FOR REUSING KNOWLEDGE FROM LARGE AND ILL-STRUCTURED QUALITATIVE DATA SETS

摘要

著录项

相似文献

相关主题

期刊订阅