The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Mar. 02, 2021

Filed:

Mar. 11, 2019
Applicant:

Adobe Inc., San Jose, CA (US);

Inventor:

Yaman Kumar, Inderpuri, IN;

Assignee:

Adobe Inc., San Jose, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G10L 15/22 (2006.01); G10L 15/25 (2013.01); G06N 3/08 (2006.01); G06N 3/04 (2006.01); G06K 9/00 (2006.01);
U.S. Cl.
CPC ...
G10L 15/25 (2013.01); G06K 9/00335 (2013.01); G06N 3/0454 (2013.01); G06N 3/088 (2013.01);
Abstract

A pose-invariant visual speech recognition system obtains a single view input of a speaker, such as a single video stream captured by a single camera. The single view input provides a particular pose of the speaker, which refers to a view angle, relative to the lens or image capture component of the camera that captured the video of the speaker, at which the speaker's face is captured. The pose of the speaker is used to select a visual speech recognition model to use to generate a text label that is the words spoken by the speaker. One or more additional view angles of the speaker are also generated from the single view input of the speaker. These one or more additional view angles, along with the single view input of the speaker, are used by the selected visual speech recognition model to generate the text label for the speaker.


Find Patent Forward Citations

Loading…