The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Oct. 28, 2025

Filed:

Apr. 19, 2023
Applicant:

Synaptics Incorporated, San Jose, CA (US);

Inventor:

Saeed Mosayyebpour Kaskari, Irvine, CA (US);

Assignee:

Synaptics Incorporated, San Jose, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G10L 21/0272 (2013.01); G06V 40/16 (2022.01); G10L 21/0216 (2013.01); H04R 3/00 (2006.01); H04R 5/027 (2006.01); H04S 3/00 (2006.01);
U.S. Cl.
CPC ...
G10L 21/0272 (2013.01); G06V 40/161 (2022.01); G10L 21/0216 (2013.01); H04R 3/005 (2013.01); H04R 5/027 (2013.01); H04S 3/008 (2013.01); G10L 2021/02166 (2013.01); H04S 2400/01 (2013.01); H04S 2400/15 (2013.01);
Abstract

This disclosure provides methods, devices, and systems for speech enhancement. The present implementations more specifically relate to utilizing multiple modalities to suppress audio originating from a distractor audio source without distorting audio originating from a target audio source. In some aspects, a speech enhancement system may receive a multi-channel audio signal via a microphone array and may further receive an image associated with a respective frame of the audio signal. The speech enhancement system detects one or more target faces in the image and determines whether the audio frame originates from a target audio source. For example, the speech enhancement system may compare a respective direction of each target face with a direction-of-arrival (DOA) of the audio frame. The speech enhancement system may selectively steer a beam associated with a multi-channel beamformer toward the DOA of the audio frame based on whether the audio frame originates from a target face.


Find Patent Forward Citations

Loading…