Inventors:
- Mountain View CA, US
Jun Gong - Mountain View CA, US
Marc Gendron-Bellemare - Montreal, CA
Assignee:
LOON LLC - Mountain View CA
International Classification:
G05D 1/00
G06N 3/04
G06N 3/08
G05D 1/02
B64B 1/00
Abstract:
The technology relates to navigating aerial vehicles using deep reinforcement learning techniques to generate flight policies. An operational system for controlling flight of an aerial vehicle may include a computing system configured to process an input vector representing a state of the aerial vehicle and output an action, an operation-ready policies server configured to store a trained neural network encoding a learned flight policy, and a controller configured to control the aerial vehicle. The input vector may be processed using the trained neural network encoding the learned flight policy. A method for navigating an aerial vehicle may include selecting a trained neural network encoding a learned flight policy from an operation policies server, generating an input vector comprising a set of characteristics representing a state of the aerial vehicle, selecting an action, by the trained neural network, based on the input vector, converting the action into a set of commands, by a flight computer, the set of commands configured to cause the aerial vehicle to perform the action, and causing, by a controller, the aerial vehicle to perform the action using the set of commands.