The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Aug. 22, 2023

Filed:

Jul. 30, 2018
Applicant:

International Business Machines Corporation, Armonk, NY (US);

Inventors:

Don Joven Ravoy Agravante, Tokyo, JP;

Giovanni De De Magistris, Tokyo, JP;

Tu-Hoa Pham, Tokyo, JP;

Ryuki Tachibana, Yokohama, JP;

Attorneys:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G06N 3/088 (2023.01); G06N 3/045 (2023.01); G06N 3/08 (2023.01);
U.S. Cl.
CPC ...
G06N 3/088 (2013.01); G06N 3/045 (2023.01); G06N 3/08 (2013.01);
Abstract

A computer-implemented method, computer program product, and computer processing system are provided for Hierarchical Reinforcement Learning (HRL) with a target task. The method includes obtaining, by a processor device, a sequence of tasks based on hierarchical relations between the tasks, the tasks constituting the target task. The method further includes learning, by a processor device, a sequence of constraints corresponding to the sequence of tasks by repeating, for each of the tasks in the sequence, reinforcement learning and supervised learning with a set of good samples and a set of bad samples and by applying an obtained constraint for a current task to a next task.


Find Patent Forward Citations

Loading…