The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Nov. 11, 2025

Filed:

Jan. 07, 2022
Applicant:

Toyota Research Institute, Inc., Los Altos, CA (US);

Inventors:

Blake Warren Wulfe, San Francisco, CA (US);

Rowan Mcallister, Los Altos, CA (US);

Adrien David Gaidon, Mountain View, CA (US);

Assignee:

Toyota Research Institute, Inc., Los Altos, CA (US);

Attorneys:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G06N 3/08 (2023.01);
U.S. Cl.
CPC ...
G06N 3/08 (2013.01);
Abstract

Systems and methods described herein relate to dynamics-aware comparison of reward functions. One embodiment generates a reference reward function; computes a dynamics-aware transformation of the reference reward function based on a transition model of an environment of a robot; computes a dynamics-aware transformation of a first candidate reward function based on the transition model; computes a dynamics-aware transformation of a second candidate reward function based on the transition model; selects, as a final reward function, the first or second candidate reward function based on which is closer to the reference reward function as measured by pseudometrics computed between their respective dynamics-aware transformations and the dynamics-aware transformation of the reference reward function; and optimizes the final reward function to control, at least in part, operation of the robot.


Find Patent Forward Citations

Loading…