Depends if you’re on the giving or receiving end Some developers might disagree.
Besides that, the new Tesla K20 offers a lot of benefits with cuda, like 32 MPI tasks, or that it can spawn it’s own new kernels, thus allowing recursion.
However, it’s AMDs big chance to get a foot in the door of professional studios again if they play it smart. All they have to do is polish their OpenCL drivers and stack VRAM in the 2 digits on their cards.
While the current Quadros leave the latest FirePro dead in the water in viewport performance, the GPGPU sector tells a different story.
For double precision, the professional cards from Nvidia win the race each and every time, but that’s only needed in scientific calculations we CG artists don’t care about.
In single precision for instance, in LuxMark 2.0, a Tesla K20 for 3300 Euro with 5GB VRAM scores 238 samples/sec.
A FirePro9000 for 3100 Euro with 6GB VRAM scores 1914 samples/sec.
Xeon Phi is interesting too, especially for developers. If you run it in offload mode, you just need a #pragma offload target (mic) directive and the compiler does the rest to feed the coprocessor.
It remains to see what’ll happen in the near future. The problem currently is that the hardware manufacturers are cooking faster than the developers can eat.
One might bet on CUDA the other one on OpenCL (which became as religious already as Direct3D vs. OpenGL) to develop something within one or two years to find out at the end of the period that it was a waste of time, because now XeonPhi is the ring to rule them all. And while reading into it, the next manufacturer builds the AwesomeTrace9000 card…