The first iteration of gemma-skills is officially out! 🛠️ It enables agents to build with Gemma, including using MTP to improve speed, choosing the right model size for your use case, and locating up-to-date resources. https://t.co/VZHhvwH0Zw
Google Gemma releases gemma-skills to accelerate agentic workflows with multi-token prediction
gemma-dev skill, providing agents with technical resources and support for Multi-Token Prediction (MTP) (predicting multiple tokens at once to increase speed).- Initial skill
- gemma-dev
- Optimization support
- Multi-Token Prediction (MTP)
- License
- Apache-2.0
- Installation tools
- Vercel skills CLI, Context7 skills CLI
- Programming languages
- Python, JavaScript
This release extends the Agent Skills library pattern to Google's open-weight models, addressing the knowledge layer bottleneck in agentic engineering. By automating model size selection, the library helps agents navigate the Gemma 4 lineup—ensuring they use the most cost-effective model for a task without manual developer intervention.
Integrate these skills into your workflow using the Vercel or Context7 command-line interfaces. The library is available on GitHub under an Apache-2.0 license, allowing you to add the gemma-dev skill to specific projects. This enables agents to autonomously locate documentation and optimize their own performance using MTP during multi-step tasks.
Still wondering? A few quick answers below.
Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →


