The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Aug. 22, 2023
Filed:
Jun. 19, 2020
Microsoft Technology Licensing, Llc, Redmond, WA (US);
Charumathi Lakshmanan, Bellevue, WA (US);
Ye Li, Redmond, WA (US);
Arnold Overwijk, Redmond, WA (US);
Chenyan Xiong, Bellevue, WA (US);
Jiguang Shen, Bellevue, WA (US);
Junaid Ahmed, Bellevue, WA (US);
Jiaming Guo, Kirkland, WA (US);
MICRSOFT TECHNOLOGY LICENSING, LLC, Redmond, WA (US);
Abstract
To provide automated categorization of structured textual content individual nodes of textual content, from a document object model encapsulation of the structured textual content, have a multidimensional vector associated with them, where the values of the various dimensions of the multidimensional vector are based on the textual content in the corresponding node, the visual features applied or associated with the textual content of the corresponding node, and positional information of the textual content of the corresponding node. The multidimensional vectors are input to a neighbor-imbuing neural network. The enhanced multidimensional vectors output by the neighbor-imbuing neural network are then be provided to a categorization neural network. The resulting output can be in the form of multidimensional vectors whose dimensionality is proportional to categories into which the structured textual content is to be categorized. A weighted merge takes into account multiple nodes that are grouped together.