The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Mar. 02, 2021

Filed:

Oct. 04, 2017
Applicants:

Hengshuai Yao, Markham, CA;

Hao Chen, Ottawa, CA;

Seyed Masoud Nosrati, Markham, CA;

Peyman Yadmellat, North York, CA;

Yunfei Zhang, Aurora, CA;

Inventors:

Hengshuai Yao, Markham, CA;

Hao Chen, Ottawa, CA;

Seyed Masoud Nosrati, Markham, CA;

Peyman Yadmellat, North York, CA;

Yunfei Zhang, Aurora, CA;

Assignee:
Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G05D 1/02 (2020.01); G06N 3/04 (2006.01); G06N 3/00 (2006.01); B60W 40/12 (2012.01); G06N 3/08 (2006.01); G06N 3/02 (2006.01);
U.S. Cl.
CPC ...
G05D 1/0221 (2013.01); B60W 40/12 (2013.01); G05D 1/0246 (2013.01); G06N 3/006 (2013.01); G06N 3/02 (2013.01); G06N 3/04 (2013.01); G06N 3/0454 (2013.01); G06N 3/08 (2013.01); G06N 3/084 (2013.01); G05D 2201/0213 (2013.01); G06N 3/0481 (2013.01);
Abstract

A method, device and system of prediction of a state of an object in the environment using a pre-trained action model defined by an action model neural network. A control system for an object comprises a plurality of sensors for sensing a current state and an environment in which the object is located, and a first neural network. Predicted subsequent states of the object in the environment are obtained using the action model and a current state of the object in the environment The action model maps a plurality of state-action pairs (s, a), each state-action pair encoding a state (s) of the object in the environment and an action (a) performed by the object to a predicted subsequent state (s') of the object in the environment. An action that maximizes a value of a target, based at least on a reward for each of the predicted subsequent states, is determined. The determined action is caused to be performed.


Find Patent Forward Citations

Loading…