首页> 外文会议>IEEE International Conference on Data Mining >A Probabilistic Approach for Learning with Label Proportions Applied to the US Presidential Election
【24h】

A Probabilistic Approach for Learning with Label Proportions Applied to the US Presidential Election

机译:一种带有标签比例的概率学习方法应用于美国总统大选

获取原文

摘要

Ecological inference (EI) is a classical problem from political science to model voting behavior of individuals given only aggregate election results. Flaxman et al. recently formulated EI as machine learning problem using distribution regression, and applied it to analyze US presidential elections. However, distribution regression unnecessarily aggregates individual-level covariates available from census microdata, and ignores known structure of the aggregation mechanism. We instead formulate the problem as learning with label proportions (LLP), and develop a new, probabilistic, LLP method to solve it. Our model is the straightforward one where individual votes are latent variables. We use cardinality potentials to efficiently perform exact inference over latent variables during learning, and introduce a novel message-passing algorithm to extend cardinality potentials to multivariate probability models for use within multiclass LLP problems. We show experimentally that LLP outperforms distribution regression for predicting individual-level attributes, and that our method is as good as or better than existing state-of-the-art LLP methods.
机译:生态推理(EI)是从政治科学到仅给定总体选举结果的个人投票行为模型的经典问题。 Flaxman等。最近使用分布回归将EI表示为机器学习问题,并将其用于分析美国总统选举。但是,分布回归不必要地汇总了可从普查微数据中获得的个人级别协变量,并且忽略了汇总机制的已知结构。相反,我们将问题表述为使用标签比例(LLP)学习,并开发一种新的概率LLP方法来解决该问题。我们的模型是一种简单明了的模型,其中个人选票是潜在变量。我们使用基数势来在学习过程中有效地对潜在变量进行精确推断,并引入一种新颖的消息传递算法,以将基数势扩展到用于多类LLP问题的多元概率模型。我们通过实验表明,LLP在预测个人级别的属性方面优于分布回归,并且我们的方法与现有的最新LLP方法一样好或更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号