Ptu on Ricardo Martins — Cloud Architecture, Azure, Kubernetes & AI

Ptu on Ricardo Martins — Cloud Architecture, Azure, Kubernetes & AI https://rmmartins.com/tags/ptu/ Recent content in Ptu on Ricardo Martins — Cloud Architecture, Azure, Kubernetes & AI Ricardo Martins — Cloud Architecture, Azure, Kubernetes & AI https://rmmartins.com/images/profile.png https://rmmartins.com/images/profile.png Hugo en-US Wed, 06 May 2026 21:50:50 -0400 Azure OpenAI in production: tokens, throughput, and high availability https://rmmartins.com/2026/06/19/azure-openai-in-production-tokens-throughput-and-high-availability/ Fri, 19 Jun 2026 10:00:00 -0400 https://rmmartins.com/2026/06/19/azure-openai-in-production-tokens-throughput-and-high-availability/ HTTP 429 isn't a bug, it's bad capacity planning. Deployment types, PTU vs Standard, multi-region, retry patterns, and how not to take down your chatbot on launch day.