The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Nov. 08, 2022
Filed:
May. 15, 2019
Baidu Usa Llc, Sunnyvale, CA (US);
Runxin He, Sunnyvale, CA (US);
Jinyun Zhou, Sunnyvale, CA (US);
Qi Luo, Sunnyvale, CA (US);
Shiyu Song, Sunnyvale, CA (US);
Jinghao Miao, Sunnyvale, CA (US);
Jiangtao Hu, Sunnyvale, CA (US);
Yu Wang, Sunnyvale, CA (US);
Jiaxuan Xu, Sunnyvale, CA (US);
Shu Jiang, Sunnyvale, CA (US);
BAIDU USA LLC, Sunnyvale, CA (US);
Abstract
In one embodiment, a system generates a plurality of driving scenarios to train a reinforcement learning (RL) agent and replays each of the driving scenarios to train the RL agent by: applying a RL algorithm to an initial state of a driving scenario to determine a number of control actions from a number of discretized control/action options for the ADV to advance to a number of trajectory states which are based on a number of discretized trajectory state options, determining a reward prediction by the RL algorithm for each of the controls/actions, determining a judgment score for the trajectory states, and updating the RL agent based on the judgment score.