The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jan. 07, 2025

Filed:

May. 28, 2020
Applicant:

Deepmind Technologies Limited, London, GB;

Inventors:

Ziyu Wang, St. Albans, GB;

Nicolas Manfred Otto Heess, London, GB;

Victor Constant Bapst, London, GB;

Assignee:
Attorney:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G06N 3/045 (2023.01); G06N 3/006 (2023.01); G06N 3/047 (2023.01); G06N 3/084 (2023.01); G06N 3/088 (2023.01);
U.S. Cl.
CPC ...
G06N 3/045 (2023.01); G06N 3/006 (2013.01); G06N 3/047 (2023.01); G06N 3/084 (2013.01); G06N 3/088 (2013.01);
Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training an action selection neural network. One of the methods includes maintaining a replay memory that stores trajectories generated as a result of interaction of an agent with an environment; and training an action selection neural network having policy parameters on the trajectories in the replay memory, wherein training the action selection neural network comprises: sampling a trajectory from the replay memory; and adjusting current values of the policy parameters by training the action selection neural network on the trajectory using an off-policy actor critic reinforcement learning technique.


Find Patent Forward Citations

Loading…