The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jun. 11, 2024

Filed:

Aug. 09, 2019
Applicant:

Board of Trustees of Michigan State University, East Lansing, MI (US);

Inventors:

Mi Zhang, Okemos, MI (US);

Biyi Fang, Lansing, MI (US);

Xiao Zeng, Lansing, MI (US);

Assignee:
Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06N 3/063 (2023.01); G06F 18/20 (2023.01); G06F 18/214 (2023.01); G06N 3/045 (2023.01); G06N 3/082 (2023.01); G06N 3/10 (2006.01); G06V 10/94 (2022.01); G06V 20/40 (2022.01);
U.S. Cl.
CPC ...
G06N 3/063 (2013.01); G06F 18/2148 (2023.01); G06F 18/285 (2023.01); G06N 3/045 (2023.01); G06N 3/082 (2013.01); G06N 3/10 (2013.01); G06V 10/95 (2022.01); G06V 20/40 (2022.01);
Abstract

Systems and methods are disclosed which allow mobile devices, and other resource constrained applications, to more efficiently and effectively utilize deep learning neural networks using only (or primarily) local resources. These systems and methods take the dynamics of runtime resources into account to enable resource-aware, multi-tenant on-device deep learning for artificial intelligence functions for use in tasks like mobile vision systems. The multi-capacity framework enables deep learning models to offer flexible resource-accuracy trade-offs and other similar balancing of performance and resources consumed. At runtime, various systems disclosed herein may dynamically select the optimal resource-accuracy trade-off for each deep learning model to fit the model's resource demand to the system's available runtime resources and the needs of the task being performed by the model. In doing so, systems and methods disclosed herein can efficiently utilize the limited resources in mobile systems to maximize performance of multiple concurrently running neural network-based applications.


Find Patent Forward Citations

Loading…