Yesterday NVidia announced the newest release of the CUDA Toolkit, version 3.2, that not only offers some new CUDA libraries for things like sparse matrix work and random number generation (important for crytography), but a nice 300% performance boost in the older CUDA BLAS libraries. They’ve also added an h.264 encode and decode algorithm to the mix, and some cluster management tools for folks running bigger GPGPU setups.

If you want to know more, they are also hosting a free webinar Tuesday November 23rd at 10am PT to discuss it.  Get all the details in the press release after the break.

NVIDIA Announces CUDA Toolkit 3.2, Delivering up to 300% Performance Increase

CUDA Toolkit 3.2 Production Release Features New and Improved Math Libraries
and Up To 30X Faster Performance than Latest MKL

Santa Clara, CA – November 17, 2010 – Today NVIDIA announced the availability of the CUDA Toolkit 3.2 production release, which provides significant performance increases, new math libraries and advanced cluster management features for developers creating next-generation GPU-accelerated applications.

The CUDA Toolkit includes all the tools, libraries and documentation developers need to build CUDA C/C++ applications, and is the foundation for many other GPU computing language solutions.  New features and significant performance enhancements in version 3.2 include:

  • Up to 300-percent performance improvement in CUDA BLAS (CUBLAS) library routines, delivering 8 times faster performance than the latest Intel MKL (Math Kernel Library)
  • CUDA FFT (CUFFT) library optimizations delivering 2 – 20 times faster performance than the latest MKL
  • New CURAND library for random number generation at 10-20 times faster than the latest MKL
  • New CUSPARSE library of sparse matrix routines that delivers 6-30 times faster performance than the latest MKL
  • A host of additional improvements to GPU debugging and performance analysis tools

In addition, the new CUDA Toolkit 3.2 release includes H.264 encode/decode, new Tesla Compute Cluster (TCC) integration, cluster management features, and support for the new 6GB NVIDIA Tesla and Quadro GPU products.

NVIDIA is hosting a webinar on Tuesday, Nov. 23 at 10:00 a.m. PT to review the new performance enhancements and capabilities of the new CUDA Toolkit.  To register for the webinar, please visit:  https://www2.gotomeeting.com/register/887428835.

For more release highlights and the latest CUDA Toolkit downloads, please visit: www.nvidia.com/getcuda.

About NVIDIA
NVIDIA (NASDAQ:NVDA) awakened the world to the power of computer graphics when it invented the GPU in 1999. Since then, it has consistently set new standards in visual computing with breathtaking, interactive graphics available on devices ranging from tablets and portable media players to notebooks and workstations. NVIDIA’s expertise in programmable GPUs has led to breakthroughs in parallel processing which make supercomputing inexpensive and widely accessible. The Company holds more than 1,600 patents worldwide, including ones covering designs and insights that are essential to modern computing.  For more information, see www.nvidia.com.