For the Inventor, By the Inventor

The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Patent No.:

US 11551695 B1

Date of Patent:

Jan. 10, 2023

Filed:

May. 13, 2020

Model training system for custom speech-to-text models

Applicant:

Amazon Technologies, Inc., Seattle, WA (US);

Inventors:

Vivek Govindan, Redmond, WA (US);

Varun Sembium Varadarajan, Bothell, WA (US);

Christian Egon Berkhoff Dossow, Lake Forest Park, WA (US);

Himalay Mohanlal Joriwal, Seattle, WA (US);

Sai Madhuri Bhavirisetty, Bellevue, WA (US);

Abhinav Kumar, Bellevue, WA (US);

Orestis Lykouropoulos, Seattle, WA (US);

Akshay Nalwaya, Seattle, WA (US);

Rahul Gupta, Seattle, WA (US);

Sravan Babu Bodapati, Redmond, WA (US);

Liangwei Guo, Seattle, WA (US);

Julian E. S. Salazar, San Francisco, CA (US);

Yibin Wang, Seattle, WA (US);

K P N V D S Siva Rama, Seattle, WA (US);

Calvin Xuan Li, Seattle, WA (US);

Mohit Narendra Gupta, Seattle, WA (US);

Asem Rustum, Sammamish, WA (US);

Katrin Kirchhoff, Seattle, WA (US);

Pu Zhao, Lynnwood, WA (US);

Assignee:

Amazon Technologies, Inc., Seattle, WA (US);

Attorneys:

Robert C. Kowert

Kowert, Hood, Muynon, Rankin & Goetzel, P.C.

Primary Examiner:

Int. Cl.

CPC ...

G10L 15/26 (2006.01); G10L 15/07 (2013.01); G10L 15/06 (2013.01);

U.S. Cl.

CPC ...

G10L 15/26 (2013.01); G10L 15/063 (2013.01); G10L 15/07 (2013.01); G10L 2015/0638 (2013.01);

Abstract

A transcription service may receive a request from a developer to build a custom speech-to-text model for a specific domain of speech. The custom speech-to-text model for the specific domain may replace a general speech-to-text model or add to a set of one or more speech-to-text models available for transcribing speech. The transcription service may receive a training data and instructions representing tasks. The transcription service may determine respective schedules for executing the instructions based at least in part on dependencies between the tasks. The transcription service may execute the instructions according to the respective schedules to train a speech-to-text model for a specific domain using the training data set. The transcription service may deploy the trained speech-to-text model as part of a network-accessible service for an end user to convert audio in the specific domain into texts.

Find Patent Forward Citations

Loading…