The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Aug. 19, 2025

Filed:

Nov. 16, 2023
Applicant:

Naver Corporation, Gyeonggi-do, KR;

Inventors:

David Emukpere, Auvergne-Rhone-Alpes, FR;

Bingbing Wu, Auvergne-Rhone-Alpes, FR;

Julien Perez, Auvergne-Rhone-Alpes, FR;

Assignee:
Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G05B 19/4155 (2006.01); B25J 9/16 (2006.01);
U.S. Cl.
CPC ...
G05B 19/4155 (2013.01); B25J 9/163 (2013.01); G05B 2219/39376 (2013.01);
Abstract

Systems and methods are disclosed for determining a policy to recommend transition in a position-representing space for a robotic device using a multi-critic architecture. To learn policy in a multi-critic architecture, a set of critics is defined pertaining to a position-representing space where each critic corresponds to a different objective function such as reach-reward, discovery-reward, and safety-reward. For each one of the critics of the set of critics, a learned value function in position-representing space is determined. The policy is learned based on the weighted feedback of the learned value functions to recommend transitions that are safe in the position-representing space. The multi-critic architecture minimizes interference between multiple reward functions and learns a safe and stable policy for the robotic device.


Find Patent Forward Citations

Loading…