The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Sep. 03, 2024

Filed:

Sep. 08, 2022
Applicant:

Rapid7, Inc., Boston, MA (US);

Inventors:

Viliam Holub, Prague, CZ;

Eoin Shanley, Dublin, IE;

Trevor Parsons, Boston, MA (US);

Assignee:

Rapid7, Inc., Boston, MA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 16/24 (2019.01); G06F 16/2455 (2019.01); G06F 16/2458 (2019.01); G06N 5/022 (2023.01); G06N 5/04 (2023.01); G06N 20/00 (2019.01);
U.S. Cl.
CPC ...
G06F 16/2462 (2019.01); G06F 16/24553 (2019.01); G06N 5/022 (2013.01); G06N 5/04 (2013.01); G06N 20/00 (2019.01);
Abstract

Systems and methods are disclosed for an approximate string searching technique to search for match results that have character differences with the search string. A cost is computed to measure the amount of character differences, and a match is recognized if the cost is below a threshold. The match is determined based on an inferred state machine, whose states are iteratively generated in computer memory for successive characters in the input text. States are added to represent modifications to the string needed to account for character differences and track the costs of the modifications. States are removed when their costs become excessive. Advantageously, the search process never generates the full state machine in memory, retaining only a selected set of best states to continue with the approximate match process. The technique thus enables a practicable implementation of approximate searching that can tolerate an arbitrary number of character deviations.


Find Patent Forward Citations

Loading…