The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Aug. 22, 2023
Filed:
May. 14, 2021
Microsoft Technology Licensing, Llc, Redmond, WA (US);
Yao Qian, Bellevue, WA (US);
Yu Wu, Beijing, CN;
Kenichi Kumatani, Sammamish, WA (US);
Shujie Liu, Beijing, CN;
Furu Wei, Beijing, CN;
Nanshan Zeng, Bellevue, WA (US);
Xuedong David Huang, Yarrow Point, WA (US);
Chengyi Wang, Jinan, CN;
Microsoft Technology Licensing, LLC, Redmond, WA (US);
Abstract
Systems and methods are provided for training a machine learning model to learn speech representations. Labeled speech data or both labeled and unlabeled data sets is applied to a feature extractor of a machine learning model to generate latent speech representations. The latent speech representations are applied to a quantizer to generate quantized latent speech representations and to a transformer context network to generate contextual representations. Each contextual representation included in the contextual representations is aligned with a phoneme label to generate phonetically-aware contextual representations. Quantized latent representations are aligned with phoneme labels to generate phonetically aware latent speech representations. Systems and methods also include randomly replacing a sub-set of the contextual representations with quantized latent speech representations during their alignments to phoneme labels and aligning the phonetically aware latent speech representations to the contextual representations using supervised learning.