The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Jan. 14, 2025
Filed:
Sep. 30, 2022
Amazon Technologies, Inc., Seattle, WA (US);
Monica Lakshmi Sunkara, San Jose, CA (US);
Srikanth Ronanki, San Jose, CA (US);
Sravan Babu Bodapati, Redmond, WA (US);
Jeffrey John Farris, Crystal Lake, IL (US);
Katrin Kirchhoff, Seattle, WA (US);
Vivek Govindan, Redmond, WA (US);
Yide Zou, Aachen, DE;
Mohit Narendra Gupta, Seattle, WA (US);
Silviu Mihai Burz, Sinking Spring, PA (US);
Amazon Technologies, Inc., Seattle, WA (US);
Abstract
Techniques for personalized batch and streaming speech-to-text transcription of audio reduce the error rate of automatic speech recognition (ASR) systems in transcribing rare and out-of-vocabulary words. The techniques achieve personalization of connectionist temporal classification (CT) models by using adaptive boosting to perform biasing at the level of sub-words. In addition to boosting, the techniques encompass a phone alignment network to bias sub-word predictions towards rare long-tail words and out-of-vocabulary words. A technical benefit of the techniques is that the accuracy of speech-to-text transcription of rare and out-of-vocabulary words in a custom vocabulary by automatic speech recognition (ASR) system can be improved without having to train the ASR system on the custom vocabulary. Instead, the techniques allow the same ASR system trained on a base vocabulary to realize the accuracy improvements for different custom vocabularies spanning different domains.