The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

G06F 40/30 (2020.01); G06F 40/284 (2020.01); G06F 40/211 (2020.01); G06K 9/62 (2022.01); G10L 15/08 (2006.01); G06N 3/08 (2023.01);

U.S. Cl.

CPC ...

G06F 40/30 (2020.01); G06F 40/211 (2020.01); G06F 40/284 (2020.01); G06K 9/6256 (2013.01); G06K 9/6263 (2013.01); G06N 3/08 (2013.01); G10L 2015/088 (2013.01);

Abstract

A system for extracting a key phrase from a document includes a neural key phrase extraction model ('BLING-KPE') having a first layer to extract a word sequence from the document, a second layer to represent each word in the word sequence by ELMo embedding, position embedding, and visual features, and a third layer to concatenate the ELMo embedding, the position embedding, and the visual features to produce hybrid word embeddings. A convolutional transformer models the hybrid word embeddings to n-gram embeddings, and a feedforward layer converts the n-gram embeddings into a probability distribution over a set of n-grams and calculates a key phrase score of each n-gram. The neural key phrase extraction model is trained on annotated data based on a labeled loss function to compute cross entropy loss of the key phrase score of each n-gram as compared with a label from the annotated dataset.

Find Patent Forward Citations