The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Mar. 14, 2023

Filed:

Jun. 11, 2018
Applicant:

Deepmind Technologies Limited, London, GB;

Inventors:

Marc Gendron-Bellemare, London, GB;

Mohammad Gheshlaghi Azar, London, GB;

Audrunas Gruslys, London, GB;

Remi Munos, London, GB;

Assignee:
Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06N 3/08 (2006.01); G06N 3/04 (2006.01); G06N 3/084 (2023.01);
U.S. Cl.
CPC ...
G06N 3/084 (2013.01); G06N 3/0454 (2013.01);
Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a policy neural network. The policy neural network is used to select actions to be performed by an agent that interacts with an environment by receiving an observation characterizing a state of the environment and performing an action from a set of actions in response to the received observation. A trajectory is obtained from a replay memory, and a final update to current values of the policy network parameters is determined for each training observation in the trajectory. The final updates to the current values of the policy network parameters are determined from selected action updates and leave-one-out updates.


Find Patent Forward Citations

Loading…