I tried running the same "Generate an SVG of a pelican riding a bicycle" prompt against 21 different quantized variants of the same IBM Granite 4.1 3B model - the results weren't as interesting as I had hoped https://t.co/rBvko3ZISM
Simon Willison Benchmarks IBM Granite 4.1 Quantizations for Spatial Reasoning
Simon WillisonSimon Willison tested 21 quantized variants of IBM's new Granite 4.1 3B model to see if model size correlates with SVG generation quality. His findings show no distinguishable pattern between quantization level and performance, with even the smallest variants occasionally outperforming larger ones.
GGUF variants (compressed model formats) of the IBM Granite 4.1 3B model released by Unsloth, a model optimization team. Using an SVG generation prompt, he tested files from 1.2GB to 6.34GB to see if higher-precision quantizations improved spatial reasoning.The results challenge the assumption that larger quantizations of small models are inherently more capable. While Willison previously found that local Qwen3.6 models could outperform frontier models at SVG tasks, these tests produced poor outputs across the board. This suggests that architecture matters more than compression levels.
For developers building local apps, these findings suggest that downloading the largest quantization of a 3B model may not provide a quality boost for complex reasoning. The IBM Granite 4.1 family is available under the Apache 2.0 license. You can access the quantized variants via Unsloth on Hugging Face.
Still wondering? A few quick answers below.
Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →





