The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Nov. 09, 2021

Filed:

Jan. 07, 2019
Applicant:

Amazon Technologies, Inc., Seattle, WA (US);

Inventors:

William Evan Welbourne, Seattle, WA (US);

Ross David Roessler, Seattle, WA (US);

Cheng-Hao Kuo, Seattle, WA (US);

Jim Oommen Thomas, Seattle, WA (US);

Paul Aksenti Savastinuk, Shoreline, WA (US);

Yinfei Yang, Bellevue, WA (US);

Assignee:

Amazon Technologies, Inc., Seattle, WA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
H04N 5/232 (2006.01); H04N 5/222 (2006.01); G06K 9/00 (2006.01); G10L 17/10 (2013.01); G10L 17/04 (2013.01); H04N 7/14 (2006.01);
U.S. Cl.
CPC ...
H04N 5/23219 (2013.01); G06K 9/00221 (2013.01); G06K 9/00288 (2013.01); G06K 9/00892 (2013.01); G10L 17/10 (2013.01); H04N 5/2228 (2013.01); G10L 17/04 (2013.01); H04N 7/147 (2013.01);
Abstract

Devices, systems and methods are disclosed for improving facial recognition and/or speaker recognition models by using results obtained from one model to assist in generating results from the other model. For example, a device may perform facial recognition for image data to identify users and may use the results of the facial recognition to assist in speaker recognition for corresponding audio data. Alternatively or additionally, the device may perform speaker recognition for audio data to identify users and may use the results of the speaker recognition to assist in facial recognition for corresponding image data. As a result, the device may identify users in video data that are not included in the facial recognition model and may identify users in audio data that are not included in the speaker recognition model. The facial recognition and/or speaker recognition models may be updated during run-time and/or offline using post-processed data.


Find Patent Forward Citations

Loading…