The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Patent No.:

US 10935982 B1

Date of Patent:

Mar. 02, 2021

Filed:

Oct. 04, 2017

Method of selection of an action for an object using a neural network

Applicants:

Hengshuai Yao, Markham, CA;

Hao Chen, Ottawa, CA;

Seyed Masoud Nosrati, Markham, CA;

Peyman Yadmellat, North York, CA;

Yunfei Zhang, Aurora, CA;

Inventors:

Hengshuai Yao, Markham, CA;

Hao Chen, Ottawa, CA;

Seyed Masoud Nosrati, Markham, CA;

Peyman Yadmellat, North York, CA;

Yunfei Zhang, Aurora, CA;

Assignee:

Huawei Technologies Co., Ltd., Shenzhen, CN;

Attorney:

Primary Examiner:

Redhwan K Mawari

Int. Cl.

CPC ...

G05D 1/02 (2020.01); G06N 3/04 (2006.01); G06N 3/00 (2006.01); B60W 40/12 (2012.01); G06N 3/08 (2006.01); G06N 3/02 (2006.01);

U.S. Cl.

CPC ...

G05D 1/0221 (2013.01); B60W 40/12 (2013.01); G05D 1/0246 (2013.01); G06N 3/006 (2013.01); G06N 3/02 (2013.01); G06N 3/04 (2013.01); G06N 3/0454 (2013.01); G06N 3/08 (2013.01); G06N 3/084 (2013.01); G05D 2201/0213 (2013.01); G06N 3/0481 (2013.01);

Abstract

A method, device and system of prediction of a state of an object in the environment using a pre-trained action model defined by an action model neural network. A control system for an object comprises a plurality of sensors for sensing a current state and an environment in which the object is located, and a first neural network. Predicted subsequent states of the object in the environment are obtained using the action model and a current state of the object in the environment The action model maps a plurality of state-action pairs (s, a), each state-action pair encoding a state (s) of the object in the environment and an action (a) performed by the object to a predicted subsequent state (s') of the object in the environment. An action that maximizes a value of a target, based at least on a reward for each of the predicted subsequent states, is determined. The determined action is caused to be performed.

Find Patent Forward Citations