The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Sep. 24, 2024

Filed:

Sep. 08, 2020
Applicants:

Robert Bosch Gmbh, Stuttgart, DE;

Koninklijke Philips N.v., Eindhoven, NL;

Inventors:

Michael Herman, Sindelfingen, DE;

Max Welling, Bussum, NL;

Herke Van Hoof, Diemen, NL;

Elise Van Der Pol, Amsterdam, NL;

Daniel Worrall, Eindhoven, NL;

Frans Adriaan Oliehoek, Delft, NL;

Assignee:

Robert Bosch GMBH, Stuttgart, DE;

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06V 10/82 (2022.01); G06F 18/21 (2023.01); G06N 3/04 (2023.01); G06N 3/063 (2023.01); G06N 3/08 (2023.01);
U.S. Cl.
CPC ...
G06V 10/82 (2022.01); G06F 18/217 (2023.01); G06N 3/04 (2013.01); G06N 3/063 (2013.01); G06N 3/08 (2013.01);
Abstract

Some embodiments are directed to a computer-implemented method of interacting with a physical environment according to a policy. The policy determines multiple action probabilities of respective actions based on an observable state of the physical environment. The policy includes a neural network parameterized by a set of parameters. The neural network determines the action probabilities by determining a final layer input from an observable state and applying a final layer of the neural network to the final layer input. The final layer is applied by applying a linear combination of a set of equivariant base weight matrices to the final layer input. The base weight matrices are equivariant in the sense that, for a set of multiple predefined transformations of the final layer input, each transformation causes a corresponding predefined action permutation of the base weight matrix output for the final layer input.


Find Patent Forward Citations

Loading…