The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Aug. 01, 2023

Filed:

Jul. 25, 2022
Applicant:

Deepmind Technologies Limited, London, GB;

Inventors:

Leonard Hasenclever, London, GB;

Vu Pham, London, GB;

Joshua Merel, Chicago, IL (US);

Alexandre Galashov, London, GB;

Assignee:
Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06N 3/04 (2006.01); G06N 5/00 (2006.01); G06N 20/00 (2019.01); G06F 17/18 (2006.01); G06N 3/047 (2023.01); G06N 3/045 (2023.01);
U.S. Cl.
CPC ...
G06N 3/047 (2023.01); G06F 17/18 (2013.01); G06N 3/045 (2023.01); G06N 5/00 (2013.01); G06N 20/00 (2019.01);
Abstract

A computer-implemented method of training a student machine learning system comprises receiving data indicating execution of an expert, determining one or more actions performed by the expert during the execution and a corresponding state-action Jacobian, and training the student machine learning system using a linear-feedback-stabilized policy. The linear-feedback-stabilized policy may be based on the state-action Jacobian. Also a neural network system for representing a space of probabilistic motor primitives, implemented by one or more computers. The neural network system comprises an encoder configured to generate latent variables based on a plurality of inputs, each input comprising a plurality of frames, and a decoder configured to generate an action based on one or more of the latent variables and a state.


Find Patent Forward Citations

Loading…