The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jun. 06, 2017

Filed:

Mar. 23, 2015
Applicant:

International Business Machines Corporation, Armonk, NY (US);

Inventors:

Ye Q. Chen, Shanghai, CN;

Wen J. Nie, Ningbo, CN;

Ting Wu, Shanghai, CN;

Zhao Yang, Shanghai, CN;

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G10L 15/26 (2006.01); G10L 17/02 (2013.01); H04N 7/15 (2006.01); G10L 25/57 (2013.01); H04L 12/18 (2006.01); G10L 25/87 (2013.01); H04N 7/14 (2006.01); G10L 21/10 (2013.01); G10L 17/00 (2013.01);
U.S. Cl.
CPC ...
G10L 17/02 (2013.01); G10L 15/26 (2013.01); G10L 25/57 (2013.01); G10L 25/87 (2013.01); H04L 12/1831 (2013.01); H04N 7/147 (2013.01); H04N 7/15 (2013.01); G10L 17/00 (2013.01); G10L 21/10 (2013.01); H04L 12/1827 (2013.01);
Abstract

Embodiments of the present invention disclose a method, system, and computer program product for speech summarization. A computer receives audio and video components from a video conference. The computer determines which participant is speaking based on comparing images of the participants with template images of speaking and non-speaking faces. The computer determines the voiceprint of the speaking participant by applying a Hidden Markov Model to a brief recording of the voice waveform of the participant and associates the determined voiceprint with the face of the speaking participant. The computer recognizes and transcribes the content of statements made by the speaker, determines the key points, and displays them over the face of the participant in the video conference.


Find Patent Forward Citations

Loading…