首页>
外国专利>
METHOD FOR BALANCING DATASETS OF MULTI-CLASS INSTANCE DATA
METHOD FOR BALANCING DATASETS OF MULTI-CLASS INSTANCE DATA
展开▼
机译:多类实例数据的数据平衡方法
展开▼
页面导航
摘要
著录项
相似文献
摘要
This disclosure describes a method for balancing datasets of instances in which each instancemay be labelled by a sequence, plurality or distribution of class labels. The disclosure includesperforming stochastic under-sampling (removal of dataset instances) and oversampling(replication of dataset instances) based on the distribution of classes in each instance, tominimize the ratio between the sizes of the minority class (i.e. class labelling the fewest framesacross all instances) and the majority class (i.e. class labelling the most frames across allinstances).
展开▼