The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jun. 17, 2025

Filed:

Sep. 22, 2023
Applicant:

Open Text SA Ulc, Halifax, CA;

Inventors:

Martin Brousseau, Mont-Saint-Hilaire, CA;

Steve Pettigrew, Montreal, CA;

Assignee:

OPEN TEXT SA ULC, Halifax, CA;

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 17/00 (2019.01); G06F 16/9535 (2019.01); G06F 16/955 (2019.01); G06F 40/295 (2020.01); G06F 40/30 (2020.01);
U.S. Cl.
CPC ...
G06F 16/9535 (2019.01); G06F 16/9558 (2019.01); G06F 40/295 (2020.01); G06F 40/30 (2020.01);
Abstract

A source content processor receives content from a crawler and calls a text mining engine. The text mining engine mines the content and provides metadata about the content. The source content processor applies a source content filtering rule to the content utilizing the metadata from the text mining engine. The source content filtering rule is previously built based on at least one of a named entity, a category, or a sentiment. The source content processor determines whether to persist the content according to a result from applying the source content filtering rule to the content and either stores the content in a data store or deletes the contents from the data ingestion pipeline such that the content is not persisted anywhere. Embodiments disclosed herein can significantly reduce the amount of irrelevant content through the data ingestion pipeline, prior to data persistence.


Find Patent Forward Citations

Loading…