The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jan. 21, 2025

Filed:

Sep. 23, 2021
Applicant:

Educational Testing Service, Princeton, NJ (US);

Inventors:

Swapna Somasundaran, Plainsboro, NJ (US);

Goran Glavaš, Heidelberg, DE;

Assignee:

Educational Testing Service, Princeton, NJ (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 40/284 (2020.01); G06F 18/214 (2023.01); G06N 3/04 (2023.01); G06N 3/08 (2023.01);
U.S. Cl.
CPC ...
G06F 40/284 (2020.01); G06F 18/214 (2023.01); G06N 3/04 (2013.01); G06N 3/08 (2013.01);
Abstract

Data such as unstructured text is received that includes a sequence of sentences. This received data is then tokenized into a plurality of tokens. The received data is segmented using a hierarchical transformer network model including a token transformer, a sentence transformer, and a segmentation classifier. The token transformer contextualizes tokens within sentences and yields sentence embeddings. The sentences transformer contextualizes sentence representations based on the sentence embedddings. The segmentation classifier predicts segments of the received data based on the contextualized sentence representations. Data can be provided which characterizes the segmentation of the received data. Related apparatus, systems, techniques and articles are also described.


Find Patent Forward Citations

Loading…