The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Dec. 30, 2025
Filed:
Nov. 02, 2022
Robert Bosch Gmbh, Stuttgart, DE;
Christoph Kroener, Freiberg am Neckar, DE;
Jared Evans, Sunnyvale, CA (US);
Robert Bosch GmbH, , DE;
Abstract
Methods and systems for smoothening the transition of reward systems or datasets for actor-critic reinforcement learning models. A reinforcement model such as an actor-critic model is trained on a first dataset and a first reward system. The weights of the actor model and the critic model are frozen. While these weights are frozen, an affine transformation layer is attached to a final layer of the critic model, and the affine transformation layer is trained with a second dataset and a second reward system in order to adjust a weight of the final layer of the critic model. Then, the weights of the critic model are unfrozen which allows the adjusted weight of the final layer of the critic model to be implemented. The reinforcement learning model is retrained on the second dataset and second reward system, first with just the critic weights unfrozen, and then with both actor and critic weights unfrozen.