ARM Mali-T604 and GPU Computing in Android?
Google’s newest mobile OS “Android 4.0″ has lots of improvements to the UI and guts of the system. However, one thing many people don’t know that the new OS combined with newer ARM systems enables one additional exciting feature: GPU computing with the RenderScript API. Alone that’s impressive, but combined with some of the unique hardware features it could really prove amazing. Check out the new memory and cache system supported on the new Mali-T604 (Rumored to be the guts of Samsung’s upcoming products):
The ARM Mali-T604 GPU is designed to work with the latest version (4) of the AMBA (Advanced Microcontroller Bus Architecture) which features Cache Coherent Interconnect (CCI). Data shared between processors in the system, a natural occurrence in heterogeneous computing, no longer requires costly (in terms of cycles and energy) synchronization via external memory and explicit cache maintenance operations. All of this is now performed in hardware, and is enabled transparently inside the drivers. In addition to reduced memory traffic, CCI avoids superfluous sharing of data: only data genuinely requested by another master is transferred to it, to the granularity of a cache line. No need to flush a whole buffer or data structure anymore.
These memory flush’es are one of the worst things of modern GPU & GPGPU systems: One little branch conditional can destroy your performance. In addition, every time you have to flush your data back to main memory, or load memory into the GPU, that’s a lengthy and performance-killing operation if done often. These new unified designs have the potential to nullify the impact of these operations, making GPU programming closer to CPU programming than ever before.
via GPU Computing in Android? With ARM Mali-T604 & RenderScript Compute You Can! – ARM Community.

The fifth annual GPGPU Workshop will be held in London this March alongside
AMD is pushing into the GPU-compute space hard with systems like Fusion, and has now managed to get their FirePro discrete card certified for OpenCL acceleration of the Abaqus Finite Element solver.
In a press event yesterday, AMD announced the next generation Fusion chips they are working on, and a tiny little note on a one slide mentioned a new tool from Microsoft called AMP. Over on the NVidia blog, they give a few more details about it: It’s a new GPU Programming tool from Microsoft.
AMD has a new press release out touting more OpenCL offerings, but includes a nice list of OpenCL applications. It’s not as extensive as NVidia’s CUDA lists, but has some big names like ArcSoft, Corel, Sony Vegas Pro, and Rovi.
So, for the last week or so the internet has been abuzz with stories about “BitCoin”, the new all-digital currency that’s going to destabilize governments around the world and bring us to a new utopian society. Well, yeah it’s a lot of hype. But when I heard about the “mining” aspect of it, and how it’s almost entirely GPU based, I figured I would check it out.


Comments