The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Nov. 29, 2022

Filed:

Jun. 12, 2020
Applicants:

Borislav Mavrin, York, CA;

Daniel Mark Graves, Edmonton, CA;

Inventors:

Borislav Mavrin, York, CA;

Daniel Mark Graves, Edmonton, CA;

Assignee:
Attorney:
Primary Examiner:
Int. Cl.
CPC ...
B25J 9/16 (2006.01); G06N 3/08 (2006.01);
U.S. Cl.
CPC ...
B25J 9/161 (2013.01); G06N 3/08 (2013.01);
Abstract

A robot that includes an RL agent that is configured to learn a policy to maximize the cumulative reward of a task, to determine one or more features that are minimally correlated with each other. The features are then used as pseudo-rewards, called feature rewards, where each feature reward corresponds to an option policy, or skill, the RL agent learns to maximize. In an example, the RL agent is configured to select the most relevant features to learn respective option policies from. The RL agent is configured to, for each of the selected features, learn the respective option policy that maximizes the respective feature reward. Using the learned option policies, the RL agent is configured to learn a new (second) policy for a new (second) task that can choose from any of the learned option policies or actions available to the RL agent.


Find Patent Forward Citations

Loading…