AI use cases for infra teams: AIOps and beyond
Thirteenth post in the series. In the previous one, we diagnosed the incidents that wake you up at 2 AM. Now something different: how to use AI to improve the infrastructure work itself. Flipping the perspective Over the past 12 posts, you’ve been building infra for AI: GPUs, clusters, pipelines, security, monitoring, cost management. You’ve become an expert at providing compute for data scientists. But what about using AI for your work? Log analysis, anomaly detection, capacity planning, IaC generation, automated incident response. AIOps isn’t a new buzzword; it’s the practical application of what you already understand (models, inference, tokens) to your day-to-day operations. ...