The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Dec. 15, 2020

Filed:

Aug. 14, 2019
Applicant:

Airspan Networks Inc., Boca Raton, FL (US);

Inventors:

Andrew Logothetis, Buckinghamshire, GB;

Stuart Parrott, Oxfordshire, GB;

Michael David Livingstone, Berkshire, GB;

Qasim Khan, Slough, GB;

Assignee:

AIRSPAN NETWORKS INC., Boca Raton, FL (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
H04W 4/00 (2018.01); H04W 88/14 (2009.01); G06N 20/00 (2019.01); H04W 48/00 (2009.01); H04W 48/20 (2009.01);
U.S. Cl.
CPC ...
H04W 88/14 (2013.01); G06N 20/00 (2019.01); H04W 48/17 (2013.01); H04W 48/20 (2013.01);
Abstract

An apparatus and method are provided for configuring a communication link. Wherein the apparatus has a plurality of antenna elements to support RF communication using a plurality of frequency channels, a plurality of RF processing circuits, and configuration circuitry to apply a selected configuration from a plurality of different configurations, where each configuration identifies which RF processing circuit each antenna element coupled to, and which channel allocated to each RF processing circuit. The configuration circuitry arranged to a reinforcement learning process in order to dynamically alter which of the plurality of different configurations to apply a currently selected configuration. The reinforcement learning process maintaining a future rewards record having a plurality of entries, where each entry maintains, for an associated combination of link state and configuration, an estimated future rewards indication determined using a discounted rewards mechanism. A selection policy is to select a configuration for a current link state, and a new reward is observed is dependent on how the selected configuration alters a chosen performance metric for the communication link. The estimated future rewards indication in the associated entry is then updated in dependence on the new reward.


Find Patent Forward Citations

Loading…