The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jul. 29, 2025

Filed:

Jun. 12, 2020
Applicant:

Microsoft Technology Licensing, Llc, Redmond, WA (US);

Inventors:

Lei Cui, Beijing, CN;

Shaohan Huang, Beijing, CN;

Li Dong, Beijing, CN;

Furu Wei, Beijing, CN;

Assignee:
Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06V 30/262 (2022.01); G06F 40/30 (2020.01); G06V 10/50 (2022.01); G06V 10/82 (2022.01); G06V 30/14 (2022.01); G06V 30/19 (2022.01); G06V 30/412 (2022.01);
U.S. Cl.
CPC ...
G06V 30/262 (2022.01); G06F 40/30 (2020.01); G06V 10/50 (2022.01); G06V 10/82 (2022.01); G06V 30/1448 (2022.01); G06V 30/19147 (2022.01); G06V 30/412 (2022.01);
Abstract

There is provided a solution for semantic representation of text in a document. In this solution, textual information comprising a sequence of text elements () and layout information () of the text element are determined from a document. The layout information () indicates a spatial arrangement of the plurality of text elements () presented within the document. Based at least in part on the plurality of text elements () and the layout information (), respective semantic feature representations () of the plurality of text elements () are generated. By jointly using both the textual information and the layout information (), rich semantics of the text elements () in the document can be effectively captured in the feature representations.


Find Patent Forward Citations

Loading…