Google Gemma
@googlegemma
https://t.co/rTAFbP4z2I
18retweets144likes
View on XThis flexibility addresses a common bottleneck in multimodal (AI that understands text and images) performance. By allowing a configurable visual token budget, the model scales its vision from 70 to 1120 tokens. This mirrors Gemma 4's frontier reasoning, letting you prioritize speed or accuracy for complex visual data.
You can now manually set the resolution for specific images to match task requirements. A 70-token budget is ideal for fast classification, while the 1120-token maximum provides the fine-grained detail needed for document analysis. These controls are available alongside Gemma 4 training on Fireworks AI.
https://t.co/rTAFbP4z2I
Still wondering? A few quick answers below.
More like this

ArenaMay 8
GoogleApr 27

Fireworks AIApr 28