We’re thrilled to announce the general availability (GA) of Managed Prometheus visualizations in Azure Monitor for AKS, along with an enhanced, unified AKS Monitoring experience.
Troubleshooting Kubernetes clusters is often time-consuming and complex whether you're diagnosing failures, scaling issues, or performance bottlenecks. This redesign of the existing Insights experience brings all your key monitoring data into a single, streamlined view reducing the time and effort it takes to diagnose, triage, and resolve problems so you can keep your applications running smoothly with less manual work.
By using Managed Prometheus, customers can also realize up to 80% savings on metrics costs and benefit from up to 90% faster blade load performance delivering both a powerful and cost-efficient way to monitor and optimize your AKS environment.
What’s New in GA
Since the preview release, we’ve added several capabilities:
- Control plane metrics: Gain visibility into critical components like the API server and ETCD database, essential for diagnosing cluster-level performance bottlenecks.
- Load balancer chart deep links: Jump directly into the networking drilldown view to troubleshoot failed connections and SNAT port issues more efficiently.
- Improved at-scale cluster view: Get a faster, more comprehensive overview across all your AKS clusters, making multi-cluster monitoring easier.
Simplified Troubleshooting, End to End
The enhanced AKS Monitoring experience provides both a basic (free) tier and an upgraded experience with Prometheus metrics and logging — all within a unified, single-pane-of-glass dashboard.
Here’s how it helps you troubleshoot faster:
- Identify failing components immediately
With new KPI Cards for Pod and Node Status, you can quickly spot pending or failed pods, high CPU/memory usage, or saturation issues, decreasing diagnosis time. - Monitor and manage cluster scaling smoothly
The Events Summary Card surfaces Kubernetes warnings and pending pod states, helping you respond to scale-related disruptions before they impact production. - Pinpoint root causes of latency and connectivity problems
Detailed node saturation metrics, plus control plane and load balancer insights, make it easier to isolate where slowdowns or failures are occurring — whether at the node, cluster, or network layer.
Free vs. Upgraded Metrics Overview
Here’s a quick comparison of what’s included by default versus what you get with the enhanced experience:
Basic tier metrics | Additional metrics in upgraded experience |
Alert summary card | Historical Kubernetes events (30 days) |
Events summary card | Warning events by reason |
Pod status KPI card | Namespace CPU and memory % |
Node status KPI card | Container logs by volume |
Node CPU and memory % | Top five controllers by logs volume |
VMSS OS disk bandwidth consumed % (max) | Packets dropped I/O |
VMSS OS disk IOPS consumed % (max) | |
Load balancer SNAT port usage | |
API server CPU % (max) (preview) | |
API server memory % (max) (preview) | |
ETCD database usage % (max) (preview) |
See What Customers Are Saying
Early adopters have already seen meaningful improvements:
"Azure Monitor managed Prometheus visualizations for Container Insights has been a game-changer for our team. Offloading the burden of self-hosting and maintaining our own Prometheus infrastructure has significantly reduced our operational overhead. With the managed add-on, we get the powerful insights and metrics we need without worrying about scalability, upgrades, or reliability. It seamlessly integrates into our existing Azure environment, giving us out-of-the-box visibility into our container workloads. This solution allows our engineers to focus more on building and delivering features, rather than managing monitoring infrastructure." – S500 customer in health care industry
Get Started Today
We’re committed to helping you optimize and manage your AKS clusters with confidence. Visit the Azure portal and explore the new AKS Monitoring experience today!
Learn more: https://aka.ms/azmon-prometheus-visualizations
Updated May 14, 2025
Version 1.0viviandiec
Microsoft
Joined May 22, 2023
Azure Observability Blog
Follow this blog board to get notified when there's new activity