The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jul. 21, 2020

Filed:

Dec. 12, 2019
Applicant:

Alibaba Group Holding Limited, George Town, KY;

Inventors:

Hui Li, Hangzhou, CN;

Kailiang Hu, Hangzhou, CN;

Le Song, Hangzhou, CN;

Assignee:

Alibaba Group Holding Limited, George Town, Grand Cayman, KY;

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 9/48 (2006.01); G06F 16/33 (2019.01); G06F 16/31 (2019.01); G06F 9/30 (2018.01);
U.S. Cl.
CPC ...
G06F 9/4881 (2013.01); G06F 9/30065 (2013.01); G06F 16/322 (2019.01); G06F 16/334 (2019.01);
Abstract

Disclosed herein are methods, systems, and apparatus of an execution device for generating an action selection policy for completing a task in an environment that includes the execution device and one or more other devices. One method includes: in a current iteration, identifying an iterative action selection policy of an action in a state of the execution device in a previous iteration; computing a regret value in the previous iteration based on the iterative action selection policy in the previous iteration; computing an incremental action selection policy in the current iteration based on the regret value in the previous iteration but not any regret value in any iteration prior to the previous iteration; computing an iterative action selection policy in the current iteration based on the iterative action selection policy in the previous iteration and the incremental action selection policy in the current iteration.


Find Patent Forward Citations

Loading…