The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Sep. 16, 2025

Filed:

Aug. 23, 2023
Applicant:

Jiangsu University, Zhenjiang, CN;

Inventors:

Yingfeng Cai, Zhenjiang, CN;

Sikai Lu, Zhenjiang, CN;

Hai Wang, Zhenjiang, CN;

Yubo Lian, Zhenjiang, CN;

Long Chen, Zhenjiang, CN;

Qingchao Liu, Zhenjiang, CN;

Assignee:

JIANGSU UNIVERSITY, Zhenjiang, CN;

Attorney:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
B60W 50/08 (2020.01); B60W 60/00 (2020.01); G05B 13/02 (2006.01); G06N 3/092 (2023.01); G06N 3/098 (2023.01);
U.S. Cl.
CPC ...
B60W 50/08 (2013.01); B60W 60/001 (2020.02); G05B 13/027 (2013.01); G06N 3/092 (2023.01); G06N 3/098 (2023.01); B60W 2510/188 (2013.01); B60W 2520/10 (2013.01); B60W 2520/12 (2013.01);
Abstract

The provided are a federated reinforcement learning (FRL) end-to-end autonomous driving control system and method, as well as vehicular equipment, based on complex network cognition. An FRL algorithm framework is provided, designated as FLDPPO, for dense urban traffic. This framework combines rule-based complex network cognition with end-to-end FRL through the design of a loss function. FLDPPO employs a dynamic driving guidance system to assist agents in learning rules, thereby enabling them to navigate complex urban driving environments and dense traffic scenarios. Moreover, the provided framework utilizes a multi-agent FRL architecture, whereby models are trained through parameter aggregation to safeguard vehicle-side privacy, accelerate network convergence, reduce communication consumption, and achieve a balance between sampling efficiency and high robustness of the model.


Find Patent Forward Citations

Loading…