https://rmmartins.com/azure-openai-in-production-tokens-throughput-and-high-availability/