The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Jun. 23, 2015
Filed:
Sep. 24, 2009
Manoj K. Agarwal, Noida, IN;
Manish A. Bhide, Vasant Kunj, IN;
Srilakshmi Kotwal, Hyderabad AP, IN;
Srinivas Kiran Mittapalli, Secunderabad, IN;
Sriram Padmanabhan, San Jose, CA (US);
Manoj K. Agarwal, Noida, IN;
Manish A. Bhide, Vasant Kunj, IN;
Srilakshmi Kotwal, Hyderabad AP, IN;
Srinivas Kiran Mittapalli, Secunderabad, IN;
Sriram Padmanabhan, San Jose, CA (US);
International Business Machines Corporation, Armonk, NY (US);
Abstract
Techniques for running an Extract Transform Load (ETL) job in parallel on one or more processors wherein the ETL job comprises use of an extensible markup language (XML) document are provided. The techniques include receiving an XML document input, identifying a node in the XML document at which partitioning of the XML document is to begin, sending partition information to each respective processor, performing a shallow parsing of the XML document in parallel on the one or more processors, wherein each processor performs shallow parsing using the identified partition node until it reaches its identified partition, using the shallow parsing to generate the partition of the input XML document, wherein each processor generates a different partition of the same XML document, and sending each partition in streaming format to an ETL job instance.