The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Sep. 24, 2019

Filed:

Mar. 30, 2016
Applicant:

Intuit Inc., Mountain View, CA (US);

Inventors:

Soumendra Daas, Bangalore, IN;

Nanjangud C. Narendra, Bangalore, IN;

Sekar Udayamurthy, Bangalore, IN;

Assignee:

Intuit Inc., Mountain View, CA (US);

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G06F 16/00 (2019.01); G06F 16/951 (2019.01); H04L 29/08 (2006.01); G06F 17/24 (2006.01);
U.S. Cl.
CPC ...
G06F 16/951 (2019.01); G06F 17/248 (2013.01); H04L 67/02 (2013.01);
Abstract

An automated extensible scraping script is generated for web scraping that is extensible to a plurality of domains. Web sites are classified based on common extracted domain data, further clustering the data based on common navigation structures, and using such commonalities to automate the generation of scraping code based on predefined and reusable code snippets for specific parts of the web sites. Scraping services include a mapper module and a script generator module. Building blocks include a data model updater, a navigation model generator and a navigation model matcher. An administrative module includes domain clustering and configuration file maintenance.


Find Patent Forward Citations

Loading…