The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jan. 31, 2023

Filed:

Jan. 24, 2019
Applicant:

The Research Foundation for the State University of New York, Binghamton, NY (US);

Inventors:

Lei Yu, Vestal, NY (US);

Andrew Cohen, Binghamton, NY (US);

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G06N 20/00 (2019.01); G06N 3/08 (2006.01); G05B 13/04 (2006.01); G05B 13/02 (2006.01); G06N 7/00 (2006.01);
U.S. Cl.
CPC ...
G06N 3/08 (2013.01); G05B 13/0265 (2013.01); G05B 13/048 (2013.01); G06N 7/005 (2013.01); G06N 20/00 (2019.01);
Abstract

The present technology addresses the problem of quickly and safely improving policies in online reinforcement learning domains. As its solution, an exploration strategy comprising diverse exploration (DE) is employed, which learns and deploys a diverse set of safe policies to explore the environment. DE theory explains why diversity in behavior policies enables effective exploration without sacrificing exploitation. An empirical study shows that an online policy improvement algorithm framework implementing the DE strategy can achieve both fast policy improvement and safe online performance.


Find Patent Forward Citations

Loading…