The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Dec. 20, 2016
Filed:
Dec. 13, 2012
Peking University Founder Group Co., Ltd., Beijing, CN;
Peking University, Beijing, CN;
Beijing Founder Electronics Co., Ltd., Beijing, CN;
Xinli Wu, Beijing, CN;
Jianwu Yang, Beijing, CN;
PEKING UNIVERSITY FOUNDER GROUP CO., LTD., Beijing, CN;
PEKING UNIVERSITY, Beijing, CN;
BEIJING FOUNDER ELECTRONICS CO., LTD., Beijing, CN;
Abstract
The invention discloses a method of collecting network data. This method is applicable to collection of data of network documents, published on a website, related respectively to M subjects, wherein M is a positive integer, the method including: configuring webpage link addresses, of network data to be collected, into queues of corresponding types according to types corresponding to the webpage link addresses of the network data to be collected, wherein the webpage link addresses of the network data to be collected are link addresses of webpages where the data of the network documents related respectively to the M subjects are located; obtaining webpage source codes corresponding to the webpage link addresses, of the network data to be collected, in the queues of the corresponding types; and extracting the data of the network documents corresponding to URLs corresponding to the webpage source codes according to the URL information and collection depth values of the URLs.