首页> 外国专利> SUBSAMPLING TRAINING DATA DURING ARTIFICIAL NEURAL NETWORK TRAINING

SUBSAMPLING TRAINING DATA DURING ARTIFICIAL NEURAL NETWORK TRAINING

机译：人工神经网络训练期间的采样训练数据

页面导航

摘要
著录项
相似文献

摘要

Perplexity scores are computed for training data samples during ANN training. Perplexity scores can be computed as a divergence between data defining a class associated with a current training data sample and a probability vector generated by the ANN model. Perplexity scores can alternately be computed by learning a probability density function ("PDF") fitting activation maps generated by an ANN model during training. A perplexity score can then be computed for a current training data sample by computing a probability for the current training data sample based on the PDF. If the perplexity score for a training data sample is lower than a threshold, the training data sample is removed from the training data set so that it will not be utilized for training during subsequent epochs. Training of the ANN model continues following the removal of training data samples from the training data set.

机译：计算ANN训练期间训练数据样本的困惑度分数。困惑度分数可以计算为定义与当前训练数据样本关联的类别的数据与ANN模型生成的概率向量之间的差异。可以通过学习训练期间由ANN模型生成的拟合激活图的概率密度函数（“ PDF”）来替代地计算困惑度分数。然后可以通过基于PDF计算当前训练数据样本的概率来计算当前训练数据样本的困惑度分数。如果训练数据样本的困惑度分数低于阈值，则将训练数据样本从训练数据集中删除，以使其在后续时期不会被用于训练。从训练数据集中删除训练数据样本之后，将继续对ANN模型进行训练。

著录项

公开/公告号WO2020190666A1

专利类型
公开/公告日2020-09-24

原文格式PDF
申请/专利权人 MICROSOFT TECHNOLOGY LICENSING LLC;
展开▼

申请/专利号WO2020US22507
发明设计人 CHUNG ERIC S.;BURGER DOUGLAS C.;DARVISH ROUHANI BITA;
展开▼

申请日2020-03-13
分类号G06K9/46;G06K9/62;G06N20;
国家 WO
入库时间 2022-08-21 11:09:20

相似文献

专利
外文文献
中文文献