The case for Blackwell in production agent infrastructure just got cleaner. @ArtificialAnlys AgentPerf gives the hardware picture. Together's coding agent benchmarks give the inference picture: 31% more TPS than the next-fastest OSS engine on the same hardware, through custom kernels built for Blackwell's Tensor Core instructions. Cursor runs their real-time coding agents on this stack. Learn more about how we built it in the 🧵
Together AI Delivers 31% Faster Coding Agent Inference on Blackwell
Together AITogether AI published coding agent benchmarks showing its inference engine achieves 31% more tokens per second than the next-fastest open-source engine on NVIDIA Blackwell hardware. These performance gains result from custom kernels targeting Blackwell Tensor Core instructions. Cursor now runs its real-time coding agents on this production stack to maintain low-latency feedback loops during development.
Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →



