首页> 外文期刊>Journal of Information Science >Sample size and informetric model goodness-of-fit outcomes: a search engine log case study
【24h】

Sample size and informetric model goodness-of-fit outcomes: a search engine log case study

机译:样本量和信息量模型拟合优度结果:一个搜索引擎日志案例研究

获取原文
获取原文并翻译 | 示例
           

摘要

The influence of sample size on informetric characteristics is examined to determine whether theoretical mathematical models can adequately fit large data sets. Two large data sets of queries submitted to the Excite search service were sampled for search characteristics (term frequencies, terms used per query, pages viewed per query, queries submitted per session) producing data sets of various sizes that were fitted to theoretical models to determine how the sample may influence a model's goodness-of-fit. Although theoretical models could adequately fit smaller data sets of up to 5000 observations in some cases, larger data sets could not be satisfactorily fitted using several goodness-of-fit techniques. Investigators must take into account that sample size does influence goodness-of-fit outcomes. The nature of the data and not the limitations of given goodness-of-fit tests results in significant outcomes. Such goodness-of-fit tests should be used for comparative purposes, rather than significance testing.
机译:检查样本大小对信息计量学特征的影响,以确定理论数学模型是否可以充分适合大型数据集。抽样了两个提交给Excite搜索服务的查询的大型数据集,以获取搜索特征(术语频率,每个查询使用的术语,每个查询查看的页面,每个会话提交的查询),生成各种大小的数据集,这些数据集适合理论模型以确定样本如何影响模型的拟合优度。尽管在某些情况下,理论模型可以适当地拟合多达5000个观测值的较小数据集,但使用几种拟合优度技术无法令人满意地拟合较大数据集。研究人员必须考虑到样本量确实会影响拟合优度结果。数据的性质而不是给定拟合优度检验的局限性导致了显着的结果。这种拟合优度测试应用于比较目的,而不是重要性测试。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号