The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jul. 08, 2025

Filed:

Mar. 11, 2022
Applicant:

Microsoft Technology Licensing, Llc, Redmond, WA (US);

Inventors:

Yashesh Gaur, Redmond, WA (US);

Nicholas Kibre, Redwood City, CA (US);

Issac J. Alphonso, San Jose, CA (US);

Jian Xue, Bellevue, WA (US);

Jinyu Li, Sammamish, WA (US);

Piyush Behre, Santa Clara, CA (US);

Shuangyu Chang, Davis, CA (US);

Assignee:
Attorney:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G06F 40/00 (2020.01); G06F 40/284 (2020.01); G06F 40/56 (2020.01); G10L 15/08 (2006.01);
U.S. Cl.
CPC ...
G06F 40/56 (2020.01); G06F 40/284 (2020.01); G10L 15/08 (2013.01);
Abstract

Solutions for on-device streaming inverse text normalization (ITN) include: receiving a stream of tokens, each token representing an element of human speech; tagging, by a tagger that can work in a streaming manner (e.g., a neural network), the stream of tokens with one or more tags of a plurality of tags to produce a tagged stream of tokens, each tag of the plurality of tags representing a different normalization category of a plurality of normalization categories; based on at least a first tag representing a first normalization category, converting, by a first language converter of a plurality of category-specific natural language converters (e.g., weighted finite state transducers, WFSTs), at least one token of the tagged stream of tokens, from a first lexical language form, to a first natural language form; and based on at least the first natural language form, outputting a natural language representation of the stream of tokens.


Find Patent Forward Citations

Loading…