The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Jan. 09, 2024
Filed:
Jun. 16, 2022
Tata Consultancy Services Limited, Mumbai, IN;
Arushi Jain, Noida, IN;
Shubham Paliwal, Noida, IN;
Monika Sharma, Noida, IN;
Lovekesh Vig, Noida, IN;
TATA CONSULTANCY SERVICES LIMITED, Mumbai, IN;
Abstract
State of art techniques that utilize spatial association based Table structure Recognition (TSR) have limitation in selecting minimal but most informative word pairs to generate digital table representation. Embodiments herein provide a method and system for TSR from an table image via deep spatial association of words using optimal number of word pairs, analyzed by a single classifier to determine word association. The optimal number of word pairs are identified by utilizing immediate left neighbors and immediate top neighbors approach followed redundant word pair elimination, thus enabling accurate capture of structural feature of even complex table images via minimal word pairs. The reduced number of word pairs in combination with the single classifier trained to determine the word associations into classes comprising as same cell, same row, same column and unrelated, provides TSR pipeline with reduced computational complexity, consuming less resources still generating more accurate digital representation of complex tables.