The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Nov. 24, 2009

Filed:

Apr. 26, 2006
Applicants:

Scott A. Thieret, Nashua, NH (US);

Detlev Stalling, Berlin, DE;

Malte Westerhoff, Berlin, DE;

Inventors:

Scott A. Thieret, Nashua, NH (US);

Detlev Stalling, Berlin, DE;

Malte Westerhoff, Berlin, DE;

Assignee:

Mercury Computer Systems, Inc., Chelmsford, MA (US);

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G06K 9/64 (2006.01);
U.S. Cl.
CPC ...
Abstract

The invention provides, in some aspects, methods and apparatus for signal and/or image processing which perform convolution-based filtering utilizing a graphics processing unit (GPU, also called 'graphics card') to compute multiple output pixels at once. This has the advantage of saving memory bandwidth, while leveraging the GPUs vector multiplication and dot product units during the calculation. Related aspects of the invention provide such methods and apparatus in which multiple output pixels are computed simultaneously by using render targets with more than one channel, e.g., an RGBA render target, or multiple render targets, or a combination thereof. By way of non-limiting example, methods and apparatus according to the invention implement convolution on a GPU by executing the steps of defining input image I(x,y) as input texture of size N×N; defining an RGBA render target (output) of size N/4×N; and, for each RGBA output pixel aggregating o(x,y) by (i) reading all input pixels I(x*4+i,y), with i=−4,0,4, and computing o(x,y) for the all four components of the output tuple.


Find Patent Forward Citations

Loading…