Gemma 3 Benchmark
gemma-webgpu (custom WGSL shaders) vs transformers.js (ONNX Runtime WebGPU)
gemma-webgpu
Load time—
Time to first token—
Generation speed—
Total tokens—
Total time—
transformers.js
Load time—
Time to first token—
Generation speed—
Total tokens—
Total time—
gemma-webgpu output
transformers.js output