The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Sep. 14, 2021

Filed:

Nov. 21, 2017
Applicant:

International Business Machines Corporation, Armonk, NY (US);

Inventors:

Kenneth W. Church, Dobbs Ferry, NY (US);

Dimitrios B. Dimitriadis, White Plains, NY (US);

Petr Fousek, Litomerice, CZ;

Miroslav Novak, Mohegan Lake, NY (US);

George A. Saon, Stamford, CT (US);

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G10L 15/26 (2006.01); G10L 25/78 (2013.01); G10L 15/08 (2006.01); G10L 25/51 (2013.01); G10L 17/04 (2013.01); G10L 17/00 (2013.01); G10L 15/06 (2013.01);
U.S. Cl.
CPC ...
G10L 15/26 (2013.01); G10L 15/08 (2013.01); G10L 17/00 (2013.01); G10L 17/04 (2013.01); G10L 25/51 (2013.01); G10L 25/78 (2013.01); G10L 2015/0631 (2013.01); G10L 2015/088 (2013.01);
Abstract

An approach is provided that receives an audio stream and utilizes a voice activation detection (VAD) process to create a digital audio stream of voices from at least two different speakers. An automatic speech recognition (ASR) process is applied to the digital stream with the ASR process resulting in the spoken words to which a speaker turn detection (STD) process is applied to identify a number of speaker segments with each speaker segment ending at a word boundary. A speaker clustering algorithm is then applied to the speaker segments to associate one of the speakers with each of the speaker segments.


Find Patent Forward Citations

Loading…