Table 4

Inconsistency and intrinsic difficulty of different diffusion-based generative AI models. Classification error rate and obtained with Dual Vision Transformer —DaViT— model.

Testing TrainingFLUX.l-schnellStable Diffusion 3SD XL TurboWuerstchenKandinsky 2.2
FLUX.l-schnell1.43%5.09%16.08%7.09%5.28%
Stable Diffusion 36.88%2.35%11.48%5.19%5.64%
SD XL Turbo16.19%25.47%0.93%18.03%17.68%
Wuerstchen12.50%14.41%10.41%0.98%13.11%
Kandinsky 2.210.70%10.64%19.64%8.79%1.78%

or Create an Account

Close Modal
Close Modal