The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jan. 17, 2017

Filed:

Jun. 19, 2015
Applicant:

Amazon Technologies, Inc., Seattle, WA (US);

Inventors:

Mashhour Solh, San Jose, CA (US);

Krishna Kamath Koteshwara, San Jose, CA (US);

Assignee:

AMAZON TECHNOLOGIES, INC., Seattle, WA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G10L 15/04 (2013.01); G10L 15/06 (2013.01); G10L 15/24 (2013.01);
U.S. Cl.
CPC ...
G10L 15/063 (2013.01); G10L 15/24 (2013.01);
Abstract

A speech recognition computer system uses video input as well as audio input of known speech when the speech recognition computer system is being trained to recognize unknown speech. The video of the speaker can be captured using multiple cameras, from multiple angles. The audio can be captured using multiple microphones. The video and audio can be sampled so that timing of events in the video and audio can be determined from the content independent of an audio or video capture device's clock. Video features, such as a speaker's moving body parts, can be extracted from the video and random sampled, to be used in a speech modeling process. Audio is modeled at the phoneme level, which provides word mapping with minor additional effort. The trained speech recognition computer system can then be used to recognize speech text from video/audio of unknown speech.


Find Patent Forward Citations

Loading…