首页>
外国专利>
A method for training convolutional neural networks for image recognition using image conditioned mask language modeling
A method for training convolutional neural networks for image recognition using image conditioned mask language modeling
展开▼
机译:使用图像调节掩模语言建模训练图像识别卷积神经网络的方法
展开▼
页面导航
摘要
著录项
相似文献
摘要
Problem to be solved: to provide a method of pre training convolutional neural network for image recognition based on mask language modeling.The method isEnter an image into a convolution neural network;Outputting a visual embedded tensor of a visual embedded vector from a convolution neural network;Step into tokens the captions;Randomly selecting one of the tokens in the list of masked tokens;Computing the latent expression of a token using a language model neural network;Step by step pooling visual embedded vectors in a visual embedded tensor;Steps to predict a masked token;Determining the prediction loss associated with the masked token;AndThe prediction loss is convoluted and propagated back to the neural network.Includes adjusting the parameter.Diagram
展开▼