The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
May. 28, 2024

Filed:

Jul. 03, 2020
Applicants:

Elmira Amirloo Abolfathi, North York, CA;

Jun Luo, Toronto, CA;

Peyman Yadmellat, North York, CA;

Inventors:

Elmira Amirloo Abolfathi, North York, CA;

Jun Luo, Toronto, CA;

Peyman Yadmellat, North York, CA;

Assignee:
Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06N 3/08 (2023.01); G05D 1/00 (2006.01); G06F 18/21 (2023.01); G06F 18/214 (2023.01); G06N 3/047 (2023.01);
U.S. Cl.
CPC ...
G05D 1/0088 (2013.01); G06F 18/214 (2023.01); G06F 18/217 (2023.01); G06N 3/047 (2023.01); G06N 3/08 (2013.01);
Abstract

Methods and systems of training RL agent for autonomous operation of a vehicle are described. The RL agent is trained using uniformly sampled training samples and learning a policy. After the RL agent has achieved a predetermined performance goal, data is collected including a sequence of sampled states, and for each sequence of sampled states, agent parameters, and an indication of failure of the RL agent for the sequence. A failure predictor is trained, using samples from the collected data, to predict a probability of failure of the RL agent for a given sequence of states. Sequences of states are collected by simulating interaction of the vehicle with the environment. Based on a probability of failure outputted by the failure predictor, a sequence of states is selected. The RL agent is further trained based on the selected sequence of states.


Find Patent Forward Citations

Loading…