The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Sep. 16, 2025

Filed:

Oct. 12, 2022
Applicant:

Samsung Electronics Co., Ltd., Suwon-si, KR;

Inventors:

Myungjong Kim, Milpitas, CA (US);

Taeyeon Ki, Milpitas, CA (US);

Vijendra Raj Apsingekar, San Jose, CA (US);

Sungjae Park, Seoul, KR;

Seungbeom Ryu, Suwon, KR;

Hyuk Oh, Seoul, KR;

Assignee:
Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G10L 21/00 (2013.01); G10L 17/02 (2013.01); G10L 17/06 (2013.01); G10L 21/028 (2013.01);
U.S. Cl.
CPC ...
G10L 21/028 (2013.01); G10L 17/02 (2013.01); G10L 17/06 (2013.01);
Abstract

A method includes obtaining at least a portion of an audio stream containing speech activity. At least the portion of the audio stream includes multiple segments. The method also includes, for each of the segments, generating an embedding vector that represents the segment. The method further includes, within each of multiple local windows, clustering the embedding vectors into one or more clusters to perform speaker identification. Different clusters correspond to different speakers. The method also includes presenting at least one first sequence of speaker identities based on the speaker identification for the local windows. The method further includes, within each of multiple global windows, clustering the embedding vectors into one or more clusters to perform speaker identification. Each global window includes two or more of the local windows. The method further includes presenting at least one second sequence of speaker identities based on the speaker identification for the global windows.


Find Patent Forward Citations

Loading…