Cloudflare’s Gen 13 servers double our compute throughput by rethinking the balance between cache and cores. Moving to high-core-count AMD EPYC ™ Turin CPUs, we traded large L3 cache for raw compute density. By running our new Rust-based FL2 stack, we completely mitigated the latency penalty to unlock twice the performance. https://t.co/1KNDtooLXm
Cloudflare Gen 13 Servers Double Compute Throughput with AMD EPYC Turin
· Updated
Cloudflare deployed Gen 13 servers built around the AMD EPYC Turin 9965 — a 192-core, 384-thread processor with 2 MB of L3 per core, replacing Gen 12's AMD Genoa-X (96 cores, 1152 MB total L3). Gen 13 also doubles memory to 768 GB DDR5-6400, adds 24 TB PCIe 5.0 NVMe storage, and quadruples network bandwidth to dual 100 GbE.
The cache reduction would have been a problem on FL1 — but Cloudflare's FL2, a Rust rewrite of its request handling layer, does not depend on large shared caches and scales nearly linearly with core count. Running FL2 on Turin 9965 in production, Cloudflare achieved up to 2x throughput versus Gen 12 within latency SLAs, along with 50% better performance per watt.
Teams on Cloudflare Workers and its global edge network will see the benefit as Gen 13 deploys worldwide — twice the compute capacity at current latency budgets.
Cloudflare
@Cloudflare
19retweets
View on X




