Wrong values using webgpu on q8
#18
by alien79 - opened
I've tried the q8 version and I've seen that when usingwebgpu device, result is different than wasm
this doesn't happens with fp32
is it a known limitation? what's the problem?
(I didn't tested other quantized versions)