The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Oct. 22, 2024

Filed:

Oct. 17, 2022
Applicant:

Adobe Inc., San Jose, CA (US);

Inventors:

Fabian David Caba Heilbron, Campbell, CA (US);

Xue Bai, Bellevue, WA (US);

Aseem Omprakash Agarwala, Seattle, WA (US);

Haoran Cai, Mercer Island, WA (US);

Lubomira Assenova Dontcheva, Seattle, WA (US);

Assignee:

Adobe Inc., San Jose, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G11B 27/031 (2006.01); G06V 20/40 (2022.01);
U.S. Cl.
CPC ...
G11B 27/031 (2013.01); G06V 20/41 (2022.01);
Abstract

Embodiments of the present invention provide systems, methods, and computer storage media for face-aware speaker diarization. In an example embodiment, an audio-only speaker diarization technique is applied to generate an audio-only speaker diarization of a video, an audio-visual speaker diarization technique is applied to generate a face-aware speaker diarization of the video, and the audio-only speaker diarization is refined using the face-aware speaker diarization to generate a hybrid speaker diarization that links detected faces to detected voices. In some embodiments, to accommodate videos with small faces that appear pixelated, a cropped image of any given face is extracted from each frame of the video, and the size of the cropped image is used to select a corresponding active speaker detection model to predict an active speaker score for the face in the cropped image.


Find Patent Forward Citations

Loading…