首页> 外文学位 >Comparative study of pattern recognition, neural network and statistical regression approaches to information retrieval.

【24h】

Comparative study of pattern recognition, neural network and statistical regression approaches to information retrieval.

机译：模式识别，神经网络和统计回归方法进行信息检索的比较研究。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

This dissertation presents several new retrieval methods that combine the use of Bayes' theorem and probability density estimation techniques. The new methods estimate probability of relevance from a small set of statistical features characterizing document-query pairs, such as query length, within-document term frequency, the number of matching terms between a document and a query, and the like.; The central task of computing the probability of relevance in the proposed methods is to infer the density functions of the feature vector in the relevant and irrelevant classes from training examples. Both parametric and non-parametric methods are employed to estimate the density function from training examples.; A two-layer neural network is presented. It takes as input a feature vector representing a document-query pair and returns the probability of relevance. Simple and complex neural networks are compared for retrieval performance, and the results show that more complex design do not outperform significantly the simplest design.; The performances of seven retrieval methods are compared with each other. The seven retrieval methods are: linear discriminant, quadratic discriminant, k-nearest neighbor, kernel method, neural network, linear regression, and logistic regression. All seven retrieval methods are trained on a common training set and then are applied to two large test sets, the TREC-5 test set and the TREC-6 test set.; The experimental results suggest that the seven retrieval methods may be divided into two groups. The first group consists of the logistic regression, linear regression, linear discriminant, and neural network retrieval methods, whereas the second group consists of the quadratic discriminant, k-nearest neighbor, and the kernel method. The retrieval methods within the first group perform approximately equally well on the test sets. Furthermore, any method in the first group outperforms any method in the second group. In addition to being less effective in retrieval, both the kernel method and the k-nearest neighbor method are computationally intensive.

机译：本文提出了几种结合贝叶斯定理和概率密度估计技术的新检索方法。新方法从表征文档-查询对的一小套统计特征（例如查询长度，文档内术语频率，文档与查询之间的匹配术语数等）中估计相关概率。在提出的方法中计算相关概率的中心任务是从训练示例中推断相关和不相关类中特征向量的密度函数。参数和非参数方法都被用来从训练实例中估计密度函数。提出了一个两层神经网络。它以代表文档查询对的特征向量为输入，并返回相关概率。比较了简单和复杂的神经网络的检索性能，结果表明，更复杂的设计不会明显优于最简单的设计。比较了这7种检索方法的性能。七个检索方法是：线性判别，二次判别，k最近邻，核方法，神经网络，线性回归和逻辑回归。所有七个检索方法都在一个通用的训练集上进行训练，然后应用于两个大型测试集，即TREC-5测试集和TREC-6测试集。实验结果表明，这七个检索方法可以分为两组。第一组由逻辑回归，线性回归，线性判别和神经网络检索方法组成，而第二组由二次判别，k最近邻和核方法组成。第一组中的检索方法在测试集上的表现大致相同。此外，第一组中的任何方法都优于第二组中的任何方法。除了检索效率较低外，核方法和k最近邻方法都需要大量计算。

著录项

作者
Chen, Aitao.;
展开▼
作者单位

University of California, Berkeley.;

展开▼
授予单位 University of California, Berkeley.;
学科 Library Science.; Statistics.; Information Science.
学位 Ph.D.
年度 1998
页码 161 p.
总页数 161
原文格式 PDF
正文语种 eng
中图分类图书馆学、图书馆事业;统计学;信息与知识传播;
关键词

相似文献

外文文献
中文文献
专利

1. A comparative study of multiple regression analysis and back propagation neural network approaches on plain carbon steel in submerged-arc welding [J] . Sarkar Abhijit, Dey Prasenjit, Rai R. N., Sadhana: Academy Proceedings in Engineering Science . 2016,第5期

机译：埋弧焊中普通碳素钢多元回归分析和反向传播神经网络方法的比较研究
2. A comparative study of multiple regression analysis and back propagation neural network approaches on plain carbon steel in submerged-arc welding [J] . ABHIJIT SARKAR, PRASENJIT DEY, R N RAI, Sadhana . 2016,第5期

机译：埋弧焊中普通碳素钢多元回归分析和反向传播神经网络方法的比较研究
3. A comparative study between non-linear regression and artificial neural network approaches for modelling wild oat (Avena fatua) field emergence. [J] . Chantre G. R., Blanco A. M., Forcella F., The Journal of Agricultural Science . 2014,第2期

机译：野外燕麦（Avena Fatua）射出的非线性回归与人工神经网络方法的比较研究。
4. A comparative study of signature recognition problem using statistical features and artificial neural networks [C] . Akram Mahabub, Qasim Romasa, Amin M Ashraful 2012 International Conference on Informatics, Electronics amp; Vision. . 2012

机译：基于统计特征和人工神经网络的签名识别问题比较研究
5. Automated identification of unnatural patterns on control charts: An application of statistical and self-organizing neural network pattern recognition techniques. [D] . Alghanim, Amjed Mahmoud. 1995

机译：自动识别控制图上的非自然模式：统计和自组织神经网络模式识别技术的应用。
6. Comparative Study of Back Propagation Artificial Neural Networks and Logistic Regression Model in Predicting Poor Prognosis after Acute Ischemic Stroke [O] . Yaru Liang, Qiguang Li, Peisong Chen, 2019

机译：反向传播人工神经网络和Logistic回归模型预测急性缺血性中风预后不良的比较研究
7. Convolutional Neural Networks with Transfer Learning for Recognition of COVID-19: A Comparative Study of Different Approaches [O] . Tanmay Garg, Mamta Garg, Om Prakash Mahela, 2020

机译：Covid-19识别的转移学习卷积神经网络：不同方法的比较研究

Comparative study of pattern recognition, neural network and statistical regression approaches to information retrieval.

摘要

著录项

相似文献

相关主题

期刊订阅