People don’t like to be vendor locked anymore. For that reason, they really do s...

latency-guy2 · on Jan 16, 2023

NVIDIA is so far ahead of the competition that people really do not care about the poor API that ROCm offers. Intel is not really a thing to even remotely care about for GPUs, they are still obviously important for CPU end. All the other players matter even less.

I badly wanted something decent as an alternative to CUDA for quite a while, but even after leaving AI myself, it is clear they are not near the standards that NVIDIA has been offering for nearing a decade now already.

We don't even have to get into all the issues consumers have gotten with AMD's GPUs.

imiric · on Jan 16, 2023

I'm not deeply familiar with this space, but isn't training and inference on CPUs gaining some traction[1][2][3]? It's still surely orders of magnitude slower, but as CPUs get more powerful and ML frameworks get more efficient, making this cheaper and more accessible would be a major breakthrough.

[1]: https://arxiv.org/abs/2206.10034

[2]: https://www.intel.com/content/www/us/en/developer/articles/t...

[3]: https://www.usenix.org/conference/atc19/presentation/liu-yiz...

freeone3000 · on Jan 16, 2023

CPU training was and continues to be one-tenth of the speed, dollar for dollar. It's not even close.

The inescapable trend is that bigger models are better models. So if you're doing this professionally, you're going to need GPUs not only for training, but increasingly for inference in order to get decent latency.

geysersam · on Jan 16, 2023

On the other hand, it's likely competition will catch up to Nvidia.

They were early and good. But the space is too lucrative, and the technology is not special enough that they will be able to keep their dominant position for long.

pjmlp · on Jan 16, 2023

Except Intel and AMD so far failed to provide a good development story for libraries, polyglot programming, and IDE tooling.

Unless one is happy using vim and emacs to target GPUs, coding in C or C++, with either pixel debugging or lame printfs.

Making fire with stones to avoid a proprietary lighter.

jiggawatts · on Jan 16, 2023

> Making fire with stones to avoid a proprietary lighter.

I'm stealing that quote!

Vt71fcAqt7 · on Jan 16, 2023

To add to this, CUDA doesn't run on mobile. That alone will force an (open, hopefuly) competitor to surface.

pjmlp · on Jan 16, 2023

Nor does anything Khronos related for compute APIs.

While Apple initially promoted OpenCL, Android never supported it as official API rather Renderscript.

So on iOS eventually Apple moved into Metal, usable from Objective-C and Swift, while on Android, Google has deprecated Renderscript usable from Java and Kotlin, pushing everyone to now learn Vulkan compute and do the integration on the NDK by themselves while learning C, C++ and GLSL on the process (as no big deal from their POV).

Basically hardly anyone cares with such great usability story. /s

camel-cdr · on Jan 16, 2023

> Nor does anything Khronos related for compute APIs

As you said below, vulkan compute does work on mobile devices, and while not from Khronos (other than SYCL) there are a bunch of higher level libraries using it.

pjmlp · on Jan 16, 2023

I haven't said that.

fxtentacle · on Jan 16, 2023

Yeah, TensorFlow. It's excellent for deployment on arbitrary platforms because it has an OpenGL back-end.

saurik · on Jan 16, 2023

You really think Apple is going to implement an "open" anything?

Vt71fcAqt7 · on Jan 16, 2023

I don't think Apple is the only mobile vendor. I think that between AMD, Intel, Qualcomm, Mediatek, Samsung and Google, they will realize that they are all being screwed over by Nvidia. Right now ML is still mostly research but when real products start surfacing (or, perhaps, for real products to surface to begin with) they will have to settle on something, wether it's ROCM, OneAPI or something else.

pjmlp · on Jan 16, 2023

So far they have been screwed by Google, as they also don't have OpenCL anything on Android, and Vulkan compute is something for hardcore NDK users.

coldtea · on Jan 16, 2023

You mean aside from their long history of developing open things, from Webkit (based on the KDE's open basic wkhtml and made into a full-blown browser engine) to Swift?

camel-cdr · on Jan 16, 2023

Isn't SYCL and for lower level stuff vulkan compute a good alternative?

pjmlp · on Jan 16, 2023

Vulkan compute is a joke in terms of current tooling, SYCL is for C++, not polyglot.