The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jun. 08, 2021

Filed:

Jan. 08, 2019
Applicant:

Google Llc, Mountain View, CA (US);

Inventors:

Chong Wang, Bellevue, WA (US);

Aonan Zhang, Mountain View, CA (US);

Quan Wang, Mountain View, CA (US);

Zhenyao Zhu, Mountain View, CA (US);

Assignee:

Google LLC, Mountain View, CA (US);

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G10L 17/04 (2013.01); G10L 15/04 (2013.01); G10L 15/07 (2013.01); G10L 17/02 (2013.01); G10L 17/18 (2013.01); G10L 15/26 (2006.01); G10L 17/00 (2013.01);
U.S. Cl.
CPC ...
G10L 17/04 (2013.01); G10L 15/04 (2013.01); G10L 15/075 (2013.01); G10L 15/26 (2013.01); G10L 17/00 (2013.01); G10L 17/02 (2013.01); G10L 17/18 (2013.01);
Abstract

A method includes receiving an utterance of speech and segmenting the utterance of speech into a plurality of segments. For each segment of the utterance of speech, the method also includes extracting a speaker-discriminative embedding from the segment and predicting a probability distribution over possible speakers for the segment using a probabilistic generative model configured to receive the extracted speaker-discriminative embedding as a feature input. The probabilistic generative model trained on a corpus of training speech utterances each segmented into a plurality of training segments. Each training segment including a corresponding speaker-discriminative embedding and a corresponding speaker label. The method also includes assigning a speaker label to each segment of the utterance of speech based on the probability distribution over possible speakers for the corresponding segment.


Find Patent Forward Citations

Loading…