The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
May. 23, 2023

Filed:

Nov. 16, 2018
Applicant:

Honda Motor Co., Ltd., Tokyo, JP;

Inventors:

Jiachen Yang, San Jose, CA (US);

Alireza Nakhaei Sarvedani, Sunnyvale, CA (US);

David Francis Isele, Sunnyvale, CA (US);

Kikuo Fujimura, Palo Alto, CA (US);

Assignee:
Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06N 3/08 (2023.01); G05D 1/00 (2006.01); H04W 4/44 (2018.01); G06N 3/045 (2023.01); G06N 3/047 (2023.01);
U.S. Cl.
CPC ...
G06N 3/08 (2013.01); G05D 1/0088 (2013.01); G06N 3/045 (2023.01); G06N 3/047 (2023.01); H04W 4/44 (2018.02);
Abstract

According to one aspect, cooperative multi-goal, multi-agent, multi-stage (CM3) reinforcement learning may include training a first agent using a first policy gradient and a first critic using a first loss function to learn goals in a single-agent environment using a Markov decision process, training a number of agents based on the first policy gradient and a second policy gradient and a second critic based on the first loss function and a second loss function to learn cooperation between the agents in a multi-agent environment using a Markov game to instantiate a second agent neural network, each of the agents instantiated with the first agent neural network in a pre-trained fashion, and generating a CM3 network policy based on the first agent neural network and the second agent neural network. The CM3 network policy may be implemented in a CM3 based autonomous vehicle to facilitate autonomous driving.


Find Patent Forward Citations

Loading…