Probabilistic document retrieval systems consistent with the two Poisson independence model outperforms the binary independence model if the terms are distributed as described by the model's assumptions. The Two Poisson Effectiveness Hypothesis suggests that retrieval models based upon the two Poisson model will outperform binary independent models when used on a "real-world" database, where independence and two Poisson term occurrence distributions fail to hold, because the added information obtained from incorporating term frequency information will more than compensate for the non-Poisson distributions of terms. Searches of the MED1033 database suggest that if terms are not independent and frequencies of term occurrence are not distributed in a two Poisson manner, the binary independence sequential retrieval model outperforms the two Poisson independence retrieval model.
机译:WWW文档检索的概率和布尔IR模型的组合
机译:多媒体检索的生成概率模型:查询生成与文档生成
机译:基于本体的二进制分类方法,利用概率检索模型识别多记录Web文档
机译:信息检索中文档检索的概率模型分析
机译:主题模型和动态预测模型及其在文档检索和医疗保健中的应用。
机译:在医疗文档检索中纳入统计主题模型
机译:该文件是否相关?...可能:信息检索中的概率模型调查
机译:概率信息检索中的新模型