The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Sep. 05, 2023

Filed:

Jan. 14, 2022
Applicant:

Meta Platforms Technologies, Llc, Menlo Park, CA (US);

Inventors:

Vincent Charles Cheung, San Carlos, CA (US);

Chengxuan Bai, San Mateo, CA (US);

Yating Sheng, San Francisco, CA (US);

Assignee:

META PLATFORMS TECHNOLOGIES, LLC, Menlo Park, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G10L 17/00 (2013.01); G06F 3/01 (2006.01); G06T 19/00 (2011.01); G10L 25/63 (2013.01); H04R 1/40 (2006.01); H04R 3/00 (2006.01); G06V 40/16 (2022.01);
U.S. Cl.
CPC ...
G10L 17/00 (2013.01); G06F 3/011 (2013.01); G06T 19/006 (2013.01); G06V 40/161 (2022.01); G10L 25/63 (2013.01); H04R 1/406 (2013.01); H04R 3/005 (2013.01);
Abstract

This disclosure describes transcribing speech using audio, image, and other data. A system is described that includes an audio capture system configured to capture audio data associated with a plurality of speakers, an image capture system configured to capture images of one or more of the plurality of speakers, and a speech processing engine. The speech processing engine may be configured to recognize a plurality of speech segments in the audio data, identify, for each speech segment of the plurality of speech segments and based on the images, a speaker associated with the speech segment, transcribe each of the plurality of speech segments to produce a transcription of the plurality of speech segments including, for each speech segment in the plurality of speech segments, an indication of the speaker associated with the speech segment, and analyze the transcription to produce additional data derived from the transcription.


Find Patent Forward Citations

Loading…