The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Dec. 27, 2022

Filed:

Mar. 25, 2020
Applicant:

Deepmind Technologies Limited, London, GB;

Inventors:

Razvan Pascanu, Letchworth Garden City, GB;

Raia Thais Hadsell, London, GB;

Mel Vecerik, London, GB;

Thomas Rothoerl, London, GB;

Andrei-Alexandru Rusu, London, GB;

Nicolas Manfred Otto Heess, London, GB;

Assignee:
Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06N 3/08 (2006.01); B25J 9/16 (2006.01); G06N 3/04 (2006.01); G05B 13/02 (2006.01); G06N 3/00 (2006.01);
U.S. Cl.
CPC ...
B25J 9/163 (2013.01); B25J 9/1671 (2013.01); G05B 13/027 (2013.01); G06N 3/008 (2013.01); G06N 3/0445 (2013.01); G06N 3/0454 (2013.01); G06N 3/08 (2013.01);
Abstract

A system includes a neural network system implemented by one or more computers. The neural network system is configured to receive an observation characterizing a current state of a real-world environment being interacted with by a robotic agent to perform a robotic task and to process the observation to generate a policy output that defines an action to be performed by the robotic agent in response to the observation. The neural network system includes: (i) a sequence of deep neural networks (DNNs), in which the sequence of DNNs includes a simulation-trained DNN that has been trained on interactions of a simulated version of the robotic agent with a simulated version of the real-world environment to perform a simulated version of the robotic task, and (ii) a first robot-trained DNN that is configured to receive the observation and to process the observation to generate the policy output.


Find Patent Forward Citations

Loading…