The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Nov. 25, 2025

Filed:

Jul. 11, 2022
Applicant:

Hitachi, Ltd., Tokyo, JP;

Inventors:

Takuya Kanazawa, Santa Clara, CA (US);

Haiyan Wang, Fremont, CA (US);

Chetan Gupta, San Mateo, CA (US);

Assignee:

HITACHI, LTD., Tokyo, JP;

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 16/00 (2019.01); G06N 20/00 (2019.01);
U.S. Cl.
CPC ...
G06N 20/00 (2019.01);
Abstract

A method for reinforcement learning (RL) of continuous actions. The method may include receiving a state as input to at least one actor network to predict candidate actions based on the state, wherein the state is a current observation; outputting the candidate actions from the at least one actor network; receiving the state and the candidate actions as inputs to a plurality of distributional critic networks, wherein the plurality of distributional critic networks calculates quantiles of a return distribution associated with the candidate actions in relation to the state; outputting the quantiles from the plurality of distributional critic networks; and selecting an output action based on the candidate actions and the quantiles.


Find Patent Forward Citations

Loading…