Statistical Hypothesis Testing in Positive Unlabelled Data

机译：未标记阳性数据的统计假设检验

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose a set of novel methodologies which enable valid statistical hypothesis testing when we have only positive and unlabelled (PU) examples. This type of problem, a special case of semi-supervised data, is common in text mining, bioinformatics, and computer vision. Focusing on a generalised likelihood ratio test, we have 3 key contributions: (1) a proof that assuming all unlabelled examples are negative cases is sufficient for independence testing, but not for power analysis activities; (2) a new methodology that compensates this and enables power analysis, allowing sample size determination for observing an effect with a desired power; and finally, (3) a new capability, supervision determination, which can determine a-priori the number of labelled examples the user must collect before being able to observe a desired statistical effect. Beyond general hypothesis testing, we suggest the tools will additionally be useful for information theoretic feature selection, and Bayesian Network structure learning.

机译：我们提出了一套新颖的方法论，当我们只有阳性和未标记的（PU）实例时，它们可以进行有效的统计假设检验。这种类型的问题是半监督数据的一种特殊情况，在文本挖掘，生物信息学和计算机视觉中很常见。着眼于广义似然比检验，我们有3个主要贡献：（1）证明假设所有未标记的示例都是负面案例，足以进行独立性测试，但不足以进行功效分析活动; （2）一种新的方法，可以对此进行补偿，并能够进行功效分析，从而可以确定样本大小，以观察具有所需功效的效果;最后，（3）一种新的功能，监督确定，可以先验确定用户在能够观察到所需的统计效果之前必须收集的带标签示例的数量。除了一般的假设检验之外，我们建议这些工具还可以用于信息理论特征选择和贝叶斯网络结构学习。

著录项

来源
《European conference on machine learning and knowledge discovery in databases》|2014年|66-81|共16页
会议地点
作者
Konstantinos Sechidis; Borja Calvo; Gavin Brown;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Perspectives on the Use of Null Hypothesis Statistical Testing. Part II: Is Null Hypothesis Statistical Testing an Irregular Bulk of Masonry? [J] . Marmolejo-Ramos Fernando, Cousineau Denis Educational and Psychological Measurement . 2017,第4期

机译：零假假设统计测试使用的透视图。第II部分：是不规则的大部分砌体的空假设统计测试？
2. Perspectives on the Use of Null Hypothesis Statistical Testing. Part III: The Various Nuts and Bolts of Statistical and Hypothesis Testing [J] . Fernando Marmolejo-Ramos, Denis Cousineau Educational and Psychological Measurement . 2017,第5期

机译：零假假设统计测试使用的透视图。第三部分：统计和假设检测的各种螺母和螺栓
3. Making Decisions with Data: Understanding Hypothesis Testing & Statistical Significance [J] . Cooper Robert A. The American Biology Teacher: Journal of the National Association of Biology Teachers . 2019,第8期

机译：与数据做出决定：了解假设检测和统计显着性
4. Statistical Hypothesis Testing in Positive Unlabelled Data [C] . Konstantinos Sechidis, Borja Calvo, Gavin Brown European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases . 2014

机译：积极未标记数据中的统计假设检验
5. Statistical hypothesis testing and application to biological data. [D] . Birkner, Merrill Dobbel. 2006

机译：统计假设检验并将其应用于生物学数据。
6. Perspectives on the Use of Null Hypothesis Statistical Testing. Part II: Is Null Hypothesis Statistical Testing an Irregular Bulk of Masonry? [O] . Fernando Marmolejo-Ramos, Denis Cousineau 2017

机译：使用零假设统计检验的观点。第二部分：零假设统计检验是否是不规则的砌体？
7. Statistical Hypothesis Testing in Positive Unlabelled Data [O] . Konstantinos Sechidis, Borja Calvo, Gavin Brown 2015

机译：正无标记数据中的统计假设检验

Statistical Hypothesis Testing in Positive Unlabelled Data

摘要

著录项

相似文献

相关主题

期刊订阅