...
首页> 外文期刊>American Journal of Mathematics and Statistics >Comparison of Probit and Logit Models for Binary Response Variable with Applications to Birth Data in South-Western, Nigeria
【24h】

Comparison of Probit and Logit Models for Binary Response Variable with Applications to Birth Data in South-Western, Nigeria

机译:二进制响应变量的Probit和Logit模型的比较及其在尼日利亚西南部出生数据中的应用

获取原文

摘要

Logit and Probit models are members of generalized linear models that are widely used to estimate the functional relationship between binary response variable and predictors. Comparison of regression models for binary response variable could be complicated by the choice of link function. The focus of this study is to determine which of the models will perform better in fitting birth data. The second focus of this study is to also make a comparative study between Yoruba women in Igbo-Ora known for high rate of multiple births and Yoruba women in Ogbomoso with sparse record of multiple birth, with the aim of determining factors that are responsible for the high rate of multiple births among the Igbo-Ora women in South-Western Nigeria. Theoretical derivations of the two models were given. Binomial regression model was fitted to the birth data with logit and probit links which resulted into logit and probit regression models respectively. The pchisq value of both the logit and the probit models were in excess of 0.05, an indication that the models fitted the birth data well. Due to simplicity of interpretation of results for Logit model, it was focused on for interpretation. The predictors considered in the models were age, religion, parity, tribe and the interaction term between age and religion of women. The results showed that the logit of having multiple births was positively related to tribe and the interaction term age*religion , but was found to be negatively related to women’s age, religion and their parity . It was also discovered that the odds of having multiple births by Yoruba women in Igbo-Ora is about three times higher than Yoruba women in Ogbomoso. The marked difference between the odds of having multiple births by women in Igbo-Ora and women in Ogbomoso is an indication that an inherent factor was responsible for the high rate of multiple births in Igbo-Ora. The high rate of multiple births could be ascribed to their regular consumption of Ilasa and Amala or due to a hereditary factor peculiar to Igbo-Ora women. Religion of women was only marginally significant, in the presence of other predictors. Graphical and numerical diagnostic evaluations of the models conducted revealed that the models were good summary of the birth data. Criterion-based variable selection procedure was also considered in selecting predictors that produce the most parsimonious models for both logit and probit models. Based on the AIC values, the final logit and the final probit regression models contained only tribe variable as the predictor for the most parsimonious model. It was difficult to distinguish between the logit and the probit models on the Receiver Operating Characteristic (ROC) curve, showing that the two models are quite similar, though the logit appeared to yield better performance than the probit model. This is an indication that the logit model has slight better prediction than the probit model. The choice of link function between logit and probit therefore depends on the data generating the process; and largely the choice is subjective.
机译:Logit和Probit模型是广义线性模型的成员,该模型广泛用于估计二进制响应变量和预测变量之间的函数关系。选择链接函数可能会使二进制响应变量的回归模型的比较复杂。这项研究的重点是确定哪种模型在拟合出生数据方面表现更好。这项研究的第二个重点是,还要对因多胎出生率高而闻名的伊博-奥拉地区的约鲁巴族妇女和多胎生育记录稀疏的奥格博莫索岛的约鲁巴族妇女进行比较研究,目的是确定造成多胎出生的因素。尼日利亚西南部伊博厄-奥拉族妇女的高分娩率很高。给出了两种模型的理论推导。将二项式回归模型拟合到具有logit和probit链接的出生数据,分别得出logit和probit回归模型。 logit模型和概率模型的pchisq值均超过0.05,这表明这些模型很好地拟合了出生数据。由于对Logit模型的结果解释的简单性,因此将重点放在解释上。在模型中考虑的预测因素是年龄,宗教,平价,部落和妇女年龄与宗教之间的相互作用。结果表明,多胎的对数与部落和交互项 age * religion呈正相关,但与女性的年龄,宗教和其<我>平价。还发现在伊博奥拉(Ygbo-Ora)约鲁巴(Yoruba)妇女多胎的几率是奥博莫索(Ogbomoso)约鲁巴(Yoruba)妇女的三倍。伊博奥拉州妇女与奥博博莫索州妇女多胎的几率之间的显着差异表明,内在因素是伊博奥岛高产多胎的原因。多胎分娩的高比率可能归因于他们经常食用伊拉萨和伊玛拉或伊博-奥拉妇女特有的遗传因素。在其他预测因素的存在下,妇女的宗教信仰只占很小的比例。对模型进行的图形和数字诊断评估表明,这些模型很好地概括了出生数据。在选择对数模型和概率模型都产生最简约模型的预测变量时,还考虑了基于标准的变量选择程序。基于AIC值,最终logit和最终probit回归模型仅包含部落变量作为最简约模型的预测变量。在接收器工作特性(ROC)曲线上很难区分logit模型和probit模型,这表明两个模型非常相似,尽管logit看起来比probit模型具有更好的性能。这表明对数模型比预测模型具有更好的预测。因此,logit和probit之间的链接功能的选择取决于生成过程的数据。在很大程度上,选择是主观的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号