首页>
外国专利>
METHOD OF GENERATING TRAINING DATA FOR TRAINING NEURAL NETWORK, METHOD OF TRAINING NEURAL NETWORK AND USING NEURAL NETWORK FOR AUTONOMOUS OPERATIONS
METHOD OF GENERATING TRAINING DATA FOR TRAINING NEURAL NETWORK, METHOD OF TRAINING NEURAL NETWORK AND USING NEURAL NETWORK FOR AUTONOMOUS OPERATIONS
A method of generating training data for training a neural network, method of training a neural network and using a neural network for autonomous operations, related devices and systems. In one aspect, a neural network for autonomous operation of an object in an environment is trained. Policy values are generated based a sample data set. An approximate action-value function is generated from the policy values. A set of approximated policy values is generated using the approximate action-value function for all states in the sample data set for all possible actions. Attaining target for the neural network is calculated based on the approximated policy values. A training error is calculated as the difference between the training target and the policy value for the corresponding state-action pair in the sample data set. At least some of the parameters of the neural network are updated to minimize the training error.
展开▼