They also have Gaudi which is a seperate architecture from the GPU/Xe stuff, and is already somewhat decent and holds a lot of promise IMO. Hopefully there is more competition in both the GPU space and the AI space, and it seems like Intel will have offerings for both.
We anticipate that with further optimization, Gaudi 2 will soon outperform A100s on this model. In earlier tests on our SDXL model with base PyTorch, Gaudi 2 generates a 1024x1024 image in 30 steps in 3.2 seconds, versus 3.6 seconds for PyTorch on A100s and 2.7 seconds for a generation with TensorRT on an A100.
The higher memory and fast interconnect of Gaudi 2, plus other design considerations, make it competitive to run the Diffusion Transformer architecture that underpins this next generation of media models.
I do agree on them doing a LOT for open source, but I am also wary because of their history like their compiler which they were court mandated to say may not perform as well on AMD processors, amongst many other things. Honestly though NVIDIA sets the bar very low with just how greedy they are ( case in point, the whole VDI, vGPU licensing situation etc.)
