I released LLM 0.32a0 this morning, a major backwards-compatible refactor of my LLM Python library and CLI tool for working with language models - the new changes should help LLM work better with reasoning models and other new frontier capabilities https://t.co/iLhtLrCQCL
Simon Willison Refactors LLM Library to Support Frontier Reasoning and Tools
Simon WillisonSimon Willison released an alpha version of his LLM Python library and CLI tool that moves beyond simple text prompts to support complex message sequences. The update introduces a streaming architecture for handling reasoning tokens and tool calls to maintain compatibility with frontier model capabilities.
0.32a0 of his LLM Python library and CLI tool. The update replaces text-only prompts with a message-based system and a streaming architecture designed to handle reasoning tokens (internal model thinking and logic steps) and tool calls.As frontier models shift toward agentic workflows, they no longer return simple strings. Models like Claude now produce internal reasoning and structured tool requests alongside standard text. This refactor ensures developers can capture these distinct typed parts without the library's core logic breaking or conflating different output streams.
You can now use the messages=[] array to pass conversation histories to models like GPT-5.5. The CLI highlights reasoning tokens in a different color and provides a --no-reasoning flag to suppress them. The alpha is available now via the LLM GitHub repository for testing.
Still wondering? A few quick answers below.
Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →




