Next cohort starts 1st of next month · only 3 seats left
contact@DevOpsSchool.com · +91 99057 40781 ·
PG-CERT · DevOpsSchool Certification

Prometheus with Grafana Certification (PG-CERT)

Master Prometheus metrics, PromQL, Grafana dashboards, AlertManager, and the full LGTM observability stack. Every session is a live demo in a real lab environment — not slides, not theory. You watch the instructor build it, then you build it yourself.

 4.8 / 5 · 2,300+ ratings 18,000+ certified learners 721 enrolled in last 90 days
Duration
5 weeks
Total content
100+ hours
Per tool
5 hrs · 2 assignments · 1 capstone
Final exam
3 hrs · online · open-book
NEXT COHORT · 1st of next month
₹34,999 ₹49,999 SAVE 30%
Live & interactive cohort · GST extra as applicable · EMI available
--
Days
--
Hrs
--
Min
--
Sec
Only 3 of 10 seats left

What's included
  • 5-week program · 100+ hours of content
  • Live & interactive instructor sessions
  • 2 assignments & 1 capstone per tool
  • 3-hour online open-book final exam
  • Recordings, slides & lab repos
  • Industry-recognised digital certificate
  • Lifetime forum support — ask anything, forever
  • FREE 1-year LMS access — entire DevOpsSchool LMS: 20+ courses, 50+ tools, videos, quizzes, assignments & projects.
Cohort-cancellation refund. If we cancel or postpone the cohort (instructor unavailability, low enrolment, force majeure), you receive a 100% refund within 15 days. See refund policy.
Reserve my seat — ₹34,999
Engineers we've trained work at
JPMorgan Chase Bank of America Wells Fargo Verizon Nokia World Bank GE Healthcare VMware Oracle Qualcomm Mercedes-Benz Airbus Datadog Splunk Deloitte Infosys Wipro Capgemini
# career outcomes

Walk in an engineer. Walk out a PG-CERT who ships.

By the end of PG-CERT, you'll have shipped 12 production-grade artefacts and proven you can:

Design CI/CD for multi-service applications, with branching, gates, and progressive rollout.

Provision infrastructure as code across AWS, Azure, or GCP using Terraform — including drift control.

Automate configuration at scale with Ansible — idempotent playbooks, secret-free roles.

Run containers on Kubernetes — workloads, networking, autoscaling, observability.

Shift security left — SAST/DAST, SBOM, signed images, policy as code with OPA.

Operate SLOs — define error budgets, run incident response, write postmortems.

Median salary after certification
$118K – $165K
Roles our PG-CERT alumni land: DevOps Engineer · Platform Engineer · SRE · Build & Release Manager · Cloud Automation Lead. Based on alumni reporting, 2024–25.
Start now — ₹34,999
# why this program

It's training built by people who run production for a living.

Taught by senior practitioners

Every instructor has 15+ years operating production systems — our lead instructor, Rajesh Kumar, has 20.

Build your own lab — not a sandbox

We teach you to provision a production-grade environment on your own AWS/Azure/GCP. It's the same skill you'll use day one on the job — and it goes with you when you leave.

100% demo-driven

Every session is a live demonstration in a working lab — never slides, never theory. You watch the instructor build it in real time, then you build it yourself.

Job-ready portfolio

You leave with 12 GitHub-ready projects you can show in interviews tomorrow.

# next cohort

Live cohorts — pick the track that fits your week.

Every cohort is capped at 10 learners by design. That's how the instructor still answers your real production questions in week 4 — not just the rehearsed ones from week 1.

Weekend cohort Most popular

Starts 1st of next month · Sat · Sun · 10:00 AM – 1:00 PM IST
  • 5 weekends · ~8 hrs/weekend live + self-paced
  • Designed for working professionals on IST/EST/GMT
  • Mentor office hours · Sunday 11 AM IST
  • Only 3 of 10 seats left
Reserve seat — ₹34,999

Weekday cohort

Starts 1st of next month · Mon · Wed · Fri · 8:00 – 10:00 PM IST
  • 5 weeks · ~12 hrs/week (live + self-paced)
  • Recorded same-day · always-available replay
  • Mentor office hours · Thursday 7 PM IST
  • Capped at 10 learners — small-batch by design
Reserve seat — ₹34,999

Need a custom corporate cohort for your team? Talk to us →

# curriculum · PG-CERT

Tool-by-tool. Live demos, not slides.

Each tool is taught as a working live demonstration inside a real lab environment — you see it built end-to-end before you build it yourself. The structure is identical for every tool, so you always know what's coming and what you'll have shipped by the end of the week.

5 hours
content per tool
(live + self-paced video)
2 assignments
per tool
graded with feedback
1 capstone
per tool
GitHub-public portfolio
3-hr exam
online · open-book
at the end of the program
01 Observability Foundations — Metrics, Logs & Traces Live & Interactive5 hrs · 2 assignments · 1 capstone
The three pillars of observability, monitoring vs observability, the RED method (Rate, Errors, Duration), USE method (Utilisation, Saturation, Errors), and choosing the right signals.
  • Assignments: (1) apply RED method to instrument a sample API; (2) identify USE signals for an infrastructure component
  • Capstone: define an observability strategy for a three-tier application with RED/USE metrics and SLO targets
02 Prometheus Architecture & Installation Live & Interactive5 hrs · 2 assignments · 1 capstone
Prometheus pull model, TSDB internals (chunks, WAL, compaction), retention policy, remote write/read, scrape configuration, and service discovery (static, Kubernetes, EC2, Consul).
Capstone: deploy Prometheus with Kubernetes service discovery scraping all pods with prometheus.io annotations.
03 Exporters & Custom Instrumentation Live & Interactive5 hrs · 2 assignments · 1 capstone
Node Exporter, cAdvisor, Blackbox Exporter, JMX Exporter, MySQL Exporter, custom exporters with the Go/Python client libraries, and Pushgateway for batch jobs.
Capstone: write a custom Go exporter exposing five business metrics for a sample application.
04 PromQL — Querying Prometheus Live & Interactive5 hrs · 2 assignments · 1 capstone
Metric types (counter, gauge, histogram, summary), instant vs range vectors, label selectors, aggregation operators, rate() vs irate(), histogram_quantile(), and PromQL anti-patterns.
Capstone: write twenty PromQL queries covering RED signals, resource utilisation, and SLO compliance for a production app.
05 Recording Rules & Query Optimisation Live & Interactive5 hrs · 2 assignments · 1 capstone
Recording rule syntax, naming conventions, pre-computing expensive aggregations, rule groups, and reducing query latency on high-cardinality data.
Capstone: optimise ten slow dashboard queries by adding recording rules — measure before/after query times.
06 AlertManager — Routing, Inhibition & Silencing Live & Interactive5 hrs · 2 assignments · 1 capstone
AlertManager configuration (route, receiver, inhibit_rules, time_intervals), multi-window SLO alerting, Slack/PagerDuty/OpsGenie receivers, silences, and reducing alert fatigue.
Capstone: design an AlertManager routing tree covering five severity levels with deduplication and throttled escalation.
07 Grafana Fundamentals & Data Sources Live & Interactive5 hrs · 2 assignments · 1 capstone
Grafana architecture, data sources (Prometheus, Loki, Tempo, CloudWatch, Azure Monitor), panels (timeseries, stat, gauge, table, heatmap), variables, and dashboard JSON model.
Capstone: build a multi-data-source Grafana home dashboard ingesting Prometheus, Loki, and CloudWatch.
08 Grafana Dashboard Design & Golden Signals Live & Interactive5 hrs · 2 assignments · 1 capstone
USE/RED dashboard patterns, golden signals dashboards, template variables, dashboard annotations, links, repeating rows, and Grafonnet (dashboards as code).
Capstone: build a complete USE method infrastructure dashboard for a Kubernetes cluster with drill-down links.
09 SLOs in Grafana — Error Budget Dashboards Live & Interactive5 hrs · 2 assignments · 1 capstone
SLO panel plugin, multi-window multi-burn-rate alerts, error budget burn dashboards, SLO as code with Sloth/pyrra, and executive SLO reporting dashboards.
Capstone: implement SLO dashboards for three services with real-time error budget burn and 30-day reporting views.
10 Loki — Log Aggregation & LogQL Live & Interactive5 hrs · 2 assignments · 1 capstone
Loki architecture (distributor, ingester, querier), LogQL (log pipeline, metric queries), Fluent Bit log shipping, label strategy, and correlating logs with metrics in Grafana Explore.
Capstone: deploy Loki with Fluent Bit on Kubernetes, build LogQL alert rules, and correlate logs with Prometheus alerts.
11 Tempo & Distributed Tracing Live & Interactive5 hrs · 2 assignments · 1 capstone
Grafana Tempo, OpenTelemetry trace collection, TraceQL, exemplars linking metrics to traces, and the full Grafana LGTM (Loki + Grafana + Tempo + Mimir) stack.
Capstone: instrument a microservices app with OpenTelemetry, store traces in Tempo, and trace a P99 latency spike to root cause.
12 Production Prometheus — HA, Thanos & Mimir Live & Interactive5 hrs · 2 assignments · 1 capstone
Prometheus HA pairs, Thanos sidecar + query + store for long-term storage, Grafana Mimir for horizontally scalable Prometheus, remote write tuning, and multi-cluster federation.
Capstone: deploy a highly available Prometheus + Thanos stack with S3-backed long-term storage and multi-cluster querying.
Final certification exam2 hrs · online · practical
Practical exam: write PromQL queries, build Grafana dashboards, configure AlertManager routing, and design SLO dashboards against a live Prometheus + Grafana environment. Scenario-based assessment.
Want the full module breakdown?

Get the PDF syllabus with every tool, sub-topic, assignment brief, capstone spec and reading list.

Download syllabus
# your capstone portfolio

One capstone per tool. 12 GitHub-public artefacts you'll show in interviews.

Every tool you learn ends in a graded capstone project. By the end of the program you'll have a full portfolio of production-grade, employer-show-able work — sample capstones below.

CAPSTONE · PROMQL
20 production PromQL queries

RED signals, USE metrics, histogram quantiles, and SLO compliance queries for a production Kubernetes workload.

PrometheusPromQLKubernetes
CAPSTONE · ALERTMANAGER
Five-severity routing tree

P1–P5 severity routing with deduplication, throttling, PagerDuty escalation, and Slack notifications.

AlertManagerPagerDutySlack
CAPSTONE · GRAFANA
USE method Kubernetes dashboard

Complete infrastructure dashboard with CPU, memory, network, and storage USE signals plus drill-down links.

GrafanaPrometheusKubernetes
CAPSTONE · SLOs
Error budget burn dashboards

SLO dashboards for three services with real-time burn rate, 30-day error budget view, and multi-burn-rate alerts.

GrafanaSlothPrometheus
CAPSTONE · LOKI
Loki + Fluent Bit log pipeline

Fluent Bit shipping Kubernetes logs to Loki with LogQL alert rules and metric-correlated log exploration.

LokiFluent BitGrafana
CAPSTONE · HA
Prometheus + Thanos HA stack

HA Prometheus pair with Thanos sidecar, S3 long-term storage, and multi-cluster querying from a single Thanos Query.

ThanosPrometheusS3
# tools you'll master

30+ industry-standard tools, in the order a real engineer adopts them.

Prometheus
Grafana
AlertManager
Loki
Grafana Tempo
Thanos
Grafana Mimir
OpenTelemetry
Fluent Bit
Node Exporter
cAdvisor
Blackbox Exporter
PagerDuty
Slack
Kubernetes
PromQL
LogQL
TraceQL
Sloth / pyrra
Go / Python SDK
# the final exam

3 hours. Online. Open-book. Built to test what you can ship.

The PG-CERT examination is intentionally not a memorisation contest. Open-book, scenario-driven, and proctored online — it tests whether you can solve real production problems with the toolchain you spent five weeks practising.

3 hours
total duration
Online
from anywhere
Open-book
notes, docs, the LMS
Scenario-based
real engineering tasks

What it covers
  • Multi-part production scenarios that span the toolchain end-to-end
  • Pipeline design, IaC, configuration, containers, K8s, observability, security
  • Debugging exercises — given symptoms and logs, find the root cause
  • Written reasoning on trade-offs (e.g. blue/green vs canary, push vs pull GitOps)
Why open-book

In a real on-call shift you look things up. The exam mirrors that. We test the skill that actually matters — composing what you know into a working solution under time pressure. Memorising flag syntax wouldn't make you a better engineer.

Pass → certified.

Clear the exam and you'll be issued the PG-CERT digital certificate within 5 working days, with a verifiable credential ID on our public registry.

  • Two free re-attempt windows if you don't clear first time
  • Detailed feedback report on every section
  • Mock papers + walkthrough during the program
  • Hard copy of the certificate on request
See the credential
# meet your instructor

You're not learning from a content team. You're learning from the person who built it.

RK

Rajesh Kumar

Principal DevOps Engineer and Architect
20 years · DevOps · SRE · Security Early-bird practitioner · MLOps · AIOps Ex-PayPay · SoftwareAG · ServiceNow · Adobe · Intuit · IBM · Accenture 10,000+ engineers trained M.Tech · BITS Pilani 25+ certifications

Rajesh is a working practitioner with 20 years across DevOps, SRE and Security, and an early-bird operator in MLOps and AIOps — he was already running model-deployment and telemetry-driven incident pipelines years before either term became industry vocabulary. He has held principal engineering and architect roles at PayPay, SoftwareAG, ServiceNow (Netherlands), JDA Software, Intuit, Adobe, IBM/Emptoris, Ness, MindTree and Accenture. He has personally trained engineers at JPMorgan Chase, Wells Fargo, Bank of America, Verizon, Nokia, World Bank, GE Healthcare, VMware, Citrix, Oracle, Qualcomm, Ericsson, Splunk, New Relic, Datadog, Airbus, AstraZeneca, Bosch, Mercedes-Benz, Vodafone, Deloitte, EY, Capgemini, Infosys, Cognizant, HCL, Wipro and dozens more. He teaches what he runs — not what he reads.

# your credential

A certificate engineers actually recognise — and recruiters look for.

Every PG-CERT certificate is issued with a unique credential ID, a tamper-proof QR code, and a verification URL on devopsschool.com/certificates. Add it to LinkedIn in one click.

  •   Lifetime verifiable on our public registry
  •   PDF + digital badge (Credly-compatible)
  •   Recognised by hiring partners across 50+ countries
  •   Hard copy shipped on request — order here
Get certified — ₹34,999
Certificate of completion
Jane Engineer
has successfully completed
Prometheus with Grafana Certification (PG-CERT)
Credential ID · DS-PG-CERT-XXXX-XXXX
# what learners say

4.8 / 5 from 2,300+ engineers. Here's what a few of them said.

# pricing

Pick the level of support that fits your goal.

Every plan includes the full curriculum, recorded sessions, and access to our learner community.

Every plan includes 1 year of full DevOpsSchool LMS access.
Not just this one course — the entire LMS: 20+ courses, 50+ tools, videos, quizzes, assignments, and end-to-end projects. Worth ₹40,000+ on its own.
See what's in the LMS
Self-paced video ₹833 / month · billed yearly (₹9,996) All recorded sessions, labs & the full LMS — learn at your own pace.
  • Full 100+ hour recorded curriculum
  • 12 hands-on capstones on your own cloud lab (free-tier setup walkthrough included)
  • 1-year access — recordings, labs & updates
  • 3-hr online open-book exam
  • Industry-recognised certificate on completion
  • Lifetime forum support
  • Full LMS access — 20+ courses & 50+ tools
  • Live instructor classes
  • 1-on-1 mentor sessions
Get self-paced — ₹833/mo
1-on-1 Mentorship ₹99,999 full program Dedicated senior practitioner. Pace, schedule and labs tailored to you.
  • Everything in Live & Interactive
  • Private 1-on-1 instructor (your schedule)
  • Custom curriculum & labs for your stack
  • Resume & LinkedIn review
  • Mock interview & salary negotiation prep
  • Capstone & portfolio code review
  • Priority response from instructor
  • Lifetime forum support
  • Full LMS access — 20+ courses & 50+ tools
Enrol 1-on-1 — ₹99,999
Cohort-cancellation refund
If we cancel or postpone a cohort and you decline the rescheduled session, you get 100% refund within 15 days. Refund policy →
Terms & course material
All training material is the IP of DevOpsSchool and for the enrolled learner's personal use only. Terms →
Your data stays with us
We never share your data with third parties. Unsubscribe from communications anytime. Privacy →

Need an invoice for your employer? Request a corporate quote →  ·  Taxes (GST) where applicable are billed in addition to the listed price.

# why us

Why engineers pick DevOpsSchool over the alternatives.

Not slides. Not a 500-seat MOOC. Not a temporary sandbox login. Three things make the difference — then compare us line-by-line.

100% live demo. 0% slides.

Every session is the instructor screen-sharing a real working lab and building the thing in front of you — then you build it yourself. No PowerPoint, no "imagine if…".

You build your own lab.

We guide you through provisioning a free-tier AWS / Azure / GCP environment on day one — the same skill you'll use at work. A temporary sandbox login disappears the day the cohort ends. Your own lab doesn't.

10 learners. By design.

Cohorts are capped at 10 by design. The instructor still knows your name in week 4 — and still has time to debug the weird production thing you brought from work.

What matters YouTube + blogs Generic online course Boot camp DevOpsSchool PG-CERT
Teaching method You piece it together yourself Pre-recorded talking-head + slides Mix of slides & some labs Live demos in a real lab — every session
Cohort size 1 (you, alone) Hundreds to thousands 30–60 per batch 10 by design — instructor knows your name
Lab environment None Throwaway sandbox Shared sandbox login Your own AWS/Azure/GCP, guided setup
Per-tool structure Ad-hoc Inconsistent across modules Theme-based, varies wildly 5 hrs · 2 assignments · 1 capstone for every tool
Final assessment None Multiple-choice quiz Mini-project 3-hour open-book scenario exam
Portfolio at the end What you built solo 1–2 generic toy projects 1 capstone 1 capstone per tool — GitHub-public
Instructor pedigree Mixed (creator-economy) Mixed (often academic) Recent-grad TAs common Rajesh Kumar — 20 yrs, ex-PayPay/ServiceNow/Adobe
Cohort start cadence N/A — pure self-pace Self-paced only Quarterly windows New cohort every 1st of the month
Post-program support None Drip-fed retention emails 30–90 day Slack Lifetime forum + alumni community
LMS bundled No This one course only This program only 1 year full LMS — 20+ courses, 50+ tools
Refund posture N/A Vendor-specific, often none after start Usually none after week 1 100% within 15 days if we cancel
Total cost (full program) Free, slow ₹15K – ₹50K per single course ₹80K – ₹3L+ ₹34,999 · LMS + lifetime forum included

Still on the fence? Talk to an advisor →   — they'll tell you straight if PG-CERT fits your goal.

# frequently asked

Everything you'd ask on a 1-on-1 call.

Don't see your question? Ask us directly →

Do I need prior DevOps or coding experience?
A working knowledge of Linux command line and basic Git is enough. We'll bring you up to speed on everything else from Module 1. About 30% of every cohort enters from a sysadmin / dev / QA background.
What if I miss a live class?
Every session is recorded and shared with the cohort within 24 hours. You retain access to the recordings and lab repositories for the duration of the cohort and a defined access window after it. Specific access duration is confirmed at enrolment.
How does the certificate work? Is it accredited?
We issue a DevOpsSchool-credentialed digital certificate plus a verifiable badge. Each certificate has a unique credential ID and a public verification URL. While it isn't a vendor exam like AWS or CNCF, every cohort includes coaching toward those external exams as a track-add.
Can I pay in instalments / EMI?
Yes — 3, 6, and 12-month plans are available via our payment partners with 0% interest on the 3-month option. We also support employer invoicing.
What's the refund policy?
Once a training cohort is confirmed, the seat is generally non-refundable. The exception is when we cancel or postpone — instructor unavailability, low enrolment, or force majeure — in which case you receive a 100% refund within 15 working days, or you can join the rescheduled cohort. GST and payment-gateway fees are not refunded. Full details on the refund policy page.
Do you give us a cloud sandbox, or do we set one up?
We do it the way you'll do it on the job — you provision your own AWS / Azure / GCP lab, and we walk you through the free-tier setup step-by-step before module 1 starts. Most labs run at zero out-of-pocket. The point is that the skill of owning your infrastructure goes with you forever; a sandbox login disappears the day the cohort ends.
Do you offer corporate or team enrolments?
Yes — private cohorts for teams of 8+ are our most-requested format. We can run them on your schedule, in your VPC, against your internal toolchain. Request a quote.
What time-zones do the live cohorts run in?
Default schedule is IST-friendly, but the weekend cohort works for EST/CET/GMT engineers as well. Recordings cover the rest. We also run a North America-specific cohort every quarter — ask us for the calendar.
Still on the fence?

Talk to an advisor — they'll tell you straight whether this fits your goal.

Talk to advisor
# ready when you are

Reserve your PG-CERT seat — or talk to an advisor first.

Next cohort starts 1st of next month. Only 3 of 10 seats remaining. Drop your details and we'll send the full syllabus + book a free 20-min consult to map this cert to your goal.

  • No spam, no auto-dial bots
  • Syllabus PDF in your inbox in 60 seconds
  • One human reply within 4 working hours
By submitting you agree to be contacted by email, phone, or WhatsApp by DevOpsSchool about this program. We don't share your data with third parties and you can unsubscribe anytime. See privacy · terms · refund.
Talk to advisor Enrol — ₹34,999