首页> 外国专利> SUBSAMPLING TRAINING DATA DURING ARTIFICIAL NEURAL NETWORK TRAINING

SUBSAMPLING TRAINING DATA DURING ARTIFICIAL NEURAL NETWORK TRAINING

机译:人工神经网络训练期间的采样训练数据

摘要

Perplexity scores are computed for training data samples during ANN training. Perplexity scores can be computed as a divergence between data defining a class associated with a current training data sample and a probability vector generated by the ANN model. Perplexity scores can alternately be computed by learning a probability density function ("PDF") fitting activation maps generated by an ANN model during training. A perplexity score can then be computed for a current training data sample by computing a probability for the current training data sample based on the PDF. If the perplexity score for a training data sample is lower than a threshold, the training data sample is removed from the training data set so that it will not be utilized for training during subsequent epochs. Training of the ANN model continues following the removal of training data samples from the training data set.
机译:计算ANN训练期间训练数据样本的困惑度分数。困惑度分数可以计算为定义与当前训练数据样本关联的类别的数据与ANN模型生成的概率向量之间的差异。可以通过学习训练期间由ANN模型生成的拟合激活图的概率密度函数(“ PDF”)来替代地计算困惑度分数。然后可以通过基于PDF计算当前训练数据样本的概率来计算当前训练数据样本的困惑度分数。如果训练数据样本的困惑度分数低于阈值,则将训练数据样本从训练数据集中删除,以使其在后续时期不会被用于训练。从训练数据集中删除训练数据样本之后,将继续对ANN模型进行训练。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号