The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Sep. 23, 2025

Filed:

Jul. 26, 2022
Applicant:

International Business Machines Corporation, Armonk, NY (US);

Inventors:

Ang Yi, Beijing, CN;

Jing Zhang, Beijing, CN;

Hai Cheng Wang, Beijing, CN;

Jun Hong Zhao, ShangDi, CN;

Rajesh M. Desai, San Jose, CA (US);

Yang Zhong Li, Beijing, CN;

Xue Xu, Beijing, CN;

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06V 30/148 (2022.01); G06V 30/18 (2022.01);
U.S. Cl.
CPC ...
G06V 30/153 (2022.01); G06V 30/18181 (2022.01);
Abstract

A computer-implemented method for text block segmentation includes determining a first text block segmentation pattern utilized to generate a segmented text block based, at least in part, on a comparison of semantic information associated with the segmented text block and a plurality of predefined types of text block segmentation patterns indicated by a graph; calculating a first degree of confidence in a size of the segmented text block based, at least in part, on comparing semantic entities associated with the segmented text block with semantic entities indicated by leaf nodes stemming from a first non-leaf node included in the graph and representative of the first type of text block segmentation pattern; and determining that the size of the segmented text block is non-optimal based on the calculated degree of confidence in the size of the segmented text block being below a predetermined threshold.


Find Patent Forward Citations

Loading…