The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Aug. 02, 2022
Filed:
Mar. 19, 2020
Hong Kong Applied Science and Technology Research Institute Company Limited, Hong Kong, HK;
Wai Kai Arvin Tang, Hong Kong, HK;
Jinning Yang, Shenzhen, CN;
Abstract
A method for extracting information from a table includes steps as follows. Characters of a table are extracted. The characters are merged into n-gram characters. The n-gram characters are merged into words and text lines through a two-stage GNN mode. The two-stage GNN mode comprises sub steps as: spatial features, semantic features, CNN image features are extracted from a target source; a first GNN stage is processed to output graph embedding spatial features from the spatial features; and a second GNN stage is processed to output graph embedding semantic features and graph embedding CNN image features from the semantic features and the CNN image features, respectively. The text lines are merged into cells. The cells are grouped into rows, columns, and key-value pairs based on one or more adjacency matrices, a row relationship among the cells, a column relationship among the cells, and a key-value relationship among the cells.