The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Sep. 24, 2024
Filed:
Sep. 08, 2020
Robert Bosch Gmbh, Stuttgart, DE;
Koninklijke Philips N.v., Eindhoven, NL;
Michael Herman, Sindelfingen, DE;
Max Welling, Bussum, NL;
Herke Van Hoof, Diemen, NL;
Elise Van Der Pol, Amsterdam, NL;
Daniel Worrall, Eindhoven, NL;
Frans Adriaan Oliehoek, Delft, NL;
Robert Bosch GMBH, Stuttgart, DE;
Abstract
Some embodiments are directed to a computer-implemented method of interacting with a physical environment according to a policy. The policy determines multiple action probabilities of respective actions based on an observable state of the physical environment. The policy includes a neural network parameterized by a set of parameters. The neural network determines the action probabilities by determining a final layer input from an observable state and applying a final layer of the neural network to the final layer input. The final layer is applied by applying a linear combination of a set of equivariant base weight matrices to the final layer input. The base weight matrices are equivariant in the sense that, for a set of multiple predefined transformations of the final layer input, each transformation causes a corresponding predefined action permutation of the base weight matrix output for the final layer input.