The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Mar. 21, 2023

Filed:

Sep. 26, 2018
Applicant:

Microsoft Technology Licensing, Llc, Redmond, WA (US);

Inventors:

Sebastian Alexander Csar, New York, NY (US);

Uri Merhav, Rehovot, IL;

Dan Shacham, Sunnyvale, CA (US);

Assignee:
Attorney:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G06N 3/08 (2023.01); G06Q 10/06 (2023.01); G06F 16/28 (2019.01); G06N 3/04 (2023.01); G06Q 10/0631 (2023.01);
U.S. Cl.
CPC ...
G06N 3/08 (2013.01); G06F 16/285 (2019.01); G06N 3/04 (2013.01); G06Q 10/063112 (2013.01);
Abstract

In an example embodiment, a system is provided whereby a machine learning model is trained to predict a standardization for a given raw title. A neural network may be trained whose input is a raw title (such as a query string) and a list of candidate titles (either title identifications in a taxonomy, or English strings), which produces a probability that the raw title and each candidate belong to the same title. The model is able to standardize titles in any language included in the training data without first having to perform language identification or normalization of the title. Additionally, the model is able to benefit from the existence of 'loan words' (words adopted from a foreign language with little or no modification) and relations between languages.


Find Patent Forward Citations

Loading…