The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Sep. 10, 2024

Filed:

Feb. 24, 2021
Applicant:

Kabushiki Kaisha Toshiba, Tokyo, JP;

Inventors:

Steven Morad, Cambridge, GB;

Roberto Mecca, Cambridge, GB;

Rudra Poudel, Cambridge, GB;

Stephan Liwicki, Cambridge, GB;

Roberto Cipolla, Cambridge, GB;

Assignee:
Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G05D 1/02 (2020.01); G05D 1/00 (2006.01); G06F 18/214 (2023.01); G06N 20/00 (2019.01); G06V 20/56 (2022.01);
U.S. Cl.
CPC ...
G05D 1/0221 (2013.01); G05D 1/0088 (2013.01); G05D 1/0214 (2013.01); G06F 18/214 (2023.01); G06N 20/00 (2019.01); G06V 20/56 (2022.01);
Abstract

A computer-implemented method for training an agent in a first context including an entity and an environment of the entity, to allow an apparatus to perform a navigation task in a second context comprising the apparatus and a physical environment of the apparatus, the apparatus adapted to receive images of the physical environment of the apparatus and comprising a steering device adapted to control the direction of the apparatus, the method comprising: obtaining one or more navigation tasks comprising: generating a navigation task; scoring the navigation task using a machine-learned model trained to estimate the easiness of tasks; in response to the score satisfying a selection criterion, selecting the navigation task as one of the one or more navigation tasks; and training the agent using a reinforcement learning method comprising attempting to perform, by the entity, the one or more navigation tasks using images of the environment of the entity.


Find Patent Forward Citations

Loading…