The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Sep. 29, 2020

Filed:

Oct. 14, 2019
Applicant:

Deepmind Technologies Limited, London, GB;

Inventors:

Gregory Duncan Wayne, London, GB;

Timothy Paul Lillicrap, London, GB;

Chia-Chun Hung, London, GB;

Joshua Simon Abramson, London, GB;

Assignee:
Attorney:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G06K 9/62 (2006.01); G06F 11/30 (2006.01); G06N 3/08 (2006.01);
U.S. Cl.
CPC ...
G06K 9/6265 (2013.01); G06F 11/3037 (2013.01); G06F 11/3072 (2013.01); G06N 3/08 (2013.01);
Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a neural network system used to control an agent interacting with an environment to perform a specified task. One of the methods includes causing the agent to perform a task episode in which the agent attempts to perform the specified task; for each of one or more particular time steps in the sequence: generating a modified reward for the particular time step from (i) the actual reward at the time step and (ii) value predictions at one or more time steps that are more than a threshold number of time steps after the particular time step in the sequence; and training, through reinforcement learning, the neural network system using at least the modified rewards for the particular time steps.


Find Patent Forward Citations

Loading…