For the Inventor, By the Inventor

The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Patent No.:

US 12149078 B1

Date of Patent:

Nov. 19, 2024

Filed:

Oct. 11, 2020

Method for intelligently adjusting power flow based on q-learning algorithm

Applicant:

State Grid Zhejiang Electric Power Co., Ltd. Taizhou Power Supply Company, Zhejiang, CN;

Inventors:

Jian Yang, Zhejiang, CN;

Dongbo Zhang, Zhejiang, CN;

Xinjian Chen, Zhejiang, CN;

Yilun Zhu, Zhejiang, CN;

Jie Yu, Zhejiang, CN;

Daojian Hong, Zhejiang, CN;

Zhouhong Wang, Zhejiang, CN;

Chenghuai Hong, Zhejiang, CN;

Zihuai Zheng, Zhejiang, CN;

Huiying Gao, Zhejiang, CN;

Minyan Xia, Zhejiang, CN;

Bingren Wang, Zhejiang, CN;

Guode Ying, Zhejiang, CN;

Yizhi Zhu, Zhejiang, CN;

Assignees:

STATE GRID ZHEJIANG ELECTRIC POWER CO., LTD., Taizhou, CN;

TAIZHOU POWER SUPPLY COMPANY, Taizhou, CN;

Attorney:

Primary Examiner:

Emilio J Saavedra

Int. Cl.

CPC ...

G06N 20/00 (2019.01); G06Q 50/06 (2012.01); H02J 3/06 (2006.01);

U.S. Cl.

CPC ...

H02J 3/06 (2013.01); G06N 20/00 (2019.01); G06Q 50/06 (2013.01);

Abstract

A method for intelligently adjusting a power flow based on a Q-learning algorithm includes: converting a variable, an action, and a goal in a power grid to a state, an action, and a reward in the algorithm, respectively; selecting an action from an action space, giving an immediate reward based on a result of power flow calculation, and correcting a next state; forwardly observing a next exploration action based on a strategy in the Q-learning algorithm; updating a Q value in a corresponding position in a Q-value table based on the obtained reward; if a final state is not reached, going back to step 2; otherwise, increasing the number of iterations by 1; if the number of iterations does not reach predetermined value K, that is, Episode<K, going back to step 2; otherwise, that is, Episode=K, outputting the Q-value table; and outputting an optimal unit combination.

Find Patent Forward Citations

Loading…