Learn more about cloning repositories
You have read-only access
[js/webgpu] Optimize Expand (#22752) Use components = 4 if possible. llama3.2-1B becomes 20 tokens/s from 18 tokens/s on my iGPUs.