The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Jan. 06, 2009
Filed:
Mar. 29, 2006
David L. Blackman, Rego Park, NY (US);
Michael Ching, San Jose, CA (US);
Stephen Dill, San Jose, CA (US);
Ivan Eduardo Gonzalez, Pittsburgh, PA (US);
Adam Marcus, Stamford, CT (US);
Daniel Norin Meredith, Sunnyvale, CA (US);
Linda Anh Linh Nguyen, San Jose, CA (US);
David L. Blackman, Rego Park, NY (US);
Michael Ching, San Jose, CA (US);
Stephen Dill, San Jose, CA (US);
Ivan Eduardo Gonzalez, Pittsburgh, PA (US);
Adam Marcus, Stamford, CT (US);
Daniel Norin Meredith, Sunnyvale, CA (US);
Linda Anh Linh Nguyen, San Jose, CA (US);
International Business Machines Corporation, Armonk, NY (US);
Abstract
A system and method for prioritizing a fetch order of web pages. The method comprises extracting by a web crawler a set of candidate web pages to be crawled. Each web page in the set of candidate web pages is associated with a website in a computer network. A determination is made to determine if a first website score for the website is in a website score database. The first website score is associated with web pages in the set of candidate web pages if the first website score exists in the website score database. The set of candidate web pages is prioritized with respect to an associated website score for each web page in the candidate set of web pages. Content is retrieved from the set of candidate web. Hyperlinks are extracted from the content. The hyperlinks are stored in a memory unit.