The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Mar. 18, 2025

Filed:

Oct. 10, 2019
Applicant:

Google Llc, Mountain View, CA (US);

Inventors:

Quan Wang, Hoboken, NJ (US);

Ignacio Lopez Moreno, New York, NY (US);

Li Wan, New York, NY (US);

Assignee:

GOOGLE LLC, Mountain View, CA (US);

Attorney:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G10L 21/028 (2013.01); G10L 17/02 (2013.01); G10L 17/04 (2013.01); G10L 17/18 (2013.01); G10L 21/0232 (2013.01);
U.S. Cl.
CPC ...
G10L 21/028 (2013.01); G10L 17/02 (2013.01); G10L 17/04 (2013.01); G10L 17/18 (2013.01); G10L 21/0232 (2013.01);
Abstract

Processing of acoustic features of audio data to generate one or more revised versions of the acoustic features, where each of the revised versions of the acoustic features isolates one or more utterances of a single respective human speaker. Various implementations generate the acoustic features by processing audio data using portion(s) of an automatic speech recognition system. Various implementations generate the revised acoustic features by processing the acoustic features using a mask generated by processing the acoustic features and a speaker embedding for the single human speaker using a trained voice filter model. Output generated over the trained voice filter model is processed using the automatic speech recognition system to generate a predicted text representation of the utterance(s) of the single human speaker without reconstructing the audio data.


Find Patent Forward Citations

Loading…