Ballpark Learning: Estimating Labels from Rough Group Comparisons

机译：球场学习：通过粗糙组比较来估计标签

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We are interested in estimating individual labels given only coarse, aggregated signal over the data points. In our setting, we receive sets ("bags") of unlabeled instances with constraints on label proportions. We relax the unrealistic assumption of known label proportions, made in previous work; instead, we assume only to have upper and lower bounds, and constraints on bag differences. We motivate the problem, propose an intuitive formulation and algorithm, and apply our methods to real-world scenarios. Across several domains, we show how using only proportion constraints and no labeled examples, we can achieve surprisingly high accuracy. In particular, we demonstrate how to predict income level using rough stereotypes and how to perform sentiment analysis using very little information. We also apply our method to guide exploratory analysis, recovering geographical differences in twitter dialect.

机译：我们感兴趣的是估计仅在数据点上给出粗略汇总信号的单个标签。在我们的环境中，我们收到未标注实例的集合（“袋”），这些实例受到标注比例的限制。我们放宽了先前工作中对已知标签比例的不切实际假设;相反，我们假设只具有上限和下限，以及对包装袋差异的限制。我们会激发问题，提出直观的公式和算法，并将我们的方法应用于实际场景。在多个领域中，我们展示了仅使用比例约束而不使用带标签的示例，我们如何可以实现令人惊讶的高精度。特别是，我们演示了如何使用粗糙的刻板印象来预测收入水平，以及如何使用很少的信息来进行情绪分析。我们还将应用我们的方法来指导探索性分析，恢复Twitter方言中的地域差异。

著录项

来源
《European conference on machine learning and principles and practice of knowledge discovery in databases》|2016年|299-314|共16页
会议地点
作者
Tom Hope; Dafna Shahaf;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Feature Selection for Multi-Label Learning Based on F-Neighborhood Rough Sets [J] . Deng Zhixuan, Zheng Zhonglong, Deng Dayong, Quality Control, Transactions . 2020,第期

机译：基于F邻域粗集的多标签学习功能选择
2. Attribute reduction for multi-label learning with fuzzy rough set [J] . Lin Yaojin, Li Yuwen, Wang Chenxi, Knowledge-Based Systems . 2018,第JULa15期

机译：模糊粗糙集的多标签学习属性约简
3. Feature selection for multi-label learning based on kernelized fuzzy rough sets [J] . Li Yuwen, Lin Yaojin, Liu Jinghua, Neurocomputing . 2018,第NOVa27期

机译：基于核化模糊粗糙集的多标签学习特征选择
4. Ballpark Learning: Estimating Labels from Rough Group Comparisons [C] . Tom Hope, Dafna Shahaf European conference on machine learning and principles and practice of knowledge discovery in databases . 2016

机译：Ballpark学习：估算粗略组比较的标签
5. Learning from Class and Comparison Labels [D] . ?Tian, Peng 2020

机译：从课堂和比较标签学习
6. Deep-Learning-Based Semantic Labeling for 2D Mammography and Comparison of Complexity for Machine Learning Tasks [O] . Paul H. Yi, Abigail Lin, Jinchi Wei, 2019

机译：基于深度学习的2D乳腺摄影语义标记和机器学习任务的复杂度比较
7. Ballpark Learning: Estimating Labels from Rough Group Comparisons [O] . Hope, Tom, Shahaf, Dafna 2016

机译：棒球场学习：从粗糙组比较中估算标签

Ballpark Learning: Estimating Labels from Rough Group Comparisons

摘要

著录项

相似文献

相关主题

期刊订阅