DevOps NetworkDevOpsNetwork
HubsModulesRoadmapWhat's NewDaily ChallengeNew
DevOps NetworkDevOpsNetwork

Menu

PlannerLeaderboardInterview PrepModulesProjectsCheatsheetsResourcesEventsTech BlogContact & Suggestions
PlannerTrack weekly learning goals & streak
LeaderboardGlobal ranking & learning scoreboard
Interview PrepSRE & DevOps interview preparation
ProjectsHands-on projects & system designs
CheatsheetsQuick-reference syntax guides
ResourcesCurated learning bookmarks & links
EventsLive sessions & webinars schedule
Tech BlogEngineering deep dives & tutorials
Report Bug & FeedbackReport bugs & share suggestions
DevOps Network

Helping you learn modern infrastructure step-by-step. Join our community of engineers today.

Platform Status: Online

Platform

  • Career Roadmaps
  • Learning Modules
  • Hands-on Projects
  • Developer Hubs

Resources

  • Cheatsheets
  • Interview Prep
  • Concept Guides
  • Technical Glossary
  • Curated Links

Community

  • Engineering Blog
  • Live Events
  • About Us
  • Contact Us
Join the Engineering Core

Join our Newsletter

Master DevOps with 1 high-density email per week.

© 2026 DevOps Network. All rights reserved.

Privacy PolicyTerms of ServiceCookie Policy
Built by Daksh Saini

Learning Modules.

Pick a module and start learning specific DevOps tools or concepts at your own pace.

(65)
65 Modules Available
BEGINNER

GitHub Actions - Complete DevOps CI/CD Guide

Learn GitHub Actions - build automated CI/CD pipelines, run tests, deploy to cloud, and trigger workflows on every code push.

GITHUB-ACTIONSCI-CDWORKFLOWSAUTOMATION
12Topics
38Resources
START
BEGINNER

Networking

Learn networking for DevOps engineers - TCP/IP, DNS, HTTP, subnets, firewalls, and how traffic flows in production systems.

TCP-IPDNSHTTPSUBNETS
9Topics
0Resources
START
BEGINNER

Docker

Learn Docker from the ground up - understand containers, images, Dockerfile, volumes, networking, and Docker Compose. This module takes you from zero to confidently building, running, and shipping containerized applications the way it's done in real DevOps and cloud environments.

DOCKERCONTAINERSDOCKERFILEDOCKER-COMPOSE
17Topics
23Resources
START
BEGINNER

Terraform - IAC

Learn how to stop clicking through cloud consoles and start managing servers, networks, and databases with code - the way every modern DevOps team does it.

STATE-MANAGEMENTINFRASTRUCTURE-AS-CODETERRAFORMANSIBLE
14Topics
30Resources
START
BEGINNER

Linux for Devops

Learn Linux for DevOps from scratch - file system, permissions, processes, bash scripting, and production server management.

linuxfundamentalsbash
23Topics
11Resources
START
BEGINNER

Shell Scripting

Learn shell scripting and Bash for DevOps - write production scripts, automate tasks, handle errors, and schedule with cron.

BASHSHELL-SCRIPTINGAUTOMATIONLINUX
17Topics
8Resources
START
BEGINNER

Version Control - Git and GitHub

Learn Git and GitHub for DevOps - commits, branching, merging, pull requests, rebasing, and team collaboration workflows.

GITVERSION-CONTROLGITHUBBRANCHING
16Topics
8Resources
START
INTERMEDIATE

Azure DevOps

From understanding what CI/CD is to building real pipelines on Azure DevOps - everything a DevOps engineer needs to know, in the right order.

AZURE-DEVOPSCI-CDPIPELINESBOARDS
12Topics
31Resources
START
BEGINNER

GitLab for devops

A complete guide to GitLab - from version control and merge requests to building production-grade CI/CD pipelines, container registry, security scanning, and deployment automation.

ci/cdgitgitlabCI-CD
29Topics
33Resources
START
BEGINNER

Kubernetes (K8s)

Learn Kubernetes from scratch - pods, deployments, services, ingress, Helm, and production-grade container orchestration.

KUBERNETESK8SKUBECTLPODS
26Topics
32Resources
START
BEGINNER

Monitoring and Logging

Learn DevOps monitoring and logging - Prometheus, Grafana, ELK stack, alerting, and building production observability pipelines.

PROMETHEUSGRAFANAELK-STACKOBSERVABILITY
28Topics
15Resources
START
BEGINNER

Build and Package Management

This module teaches you how software goes from raw source code to a deployable artifact.

MAVENGRADLENPMBUILD-TOOLS
11Topics
31Resources
START
INTERMEDIATE

DevSecOps Basics

Security is not something you add at the end. It runs through every step of building and deploying software. This module teaches you the mindset, the tools, and the pipeline practices that make security part of your daily DevOps workflow - not an afterthought.

DEVSECOPSSECURITYSASTDAST
10Topics
8Resources
START
BEGINNER

AWS - Amazon Web Services

A complete, practical reference for DevOps engineers. Covers all 20 core AWS services with architecture, flows, real-world usage, CLI commands, and comparisons.

amazon web serviceS3CLOUDIAM
26Topics
37Resources
START
BEGINNER

Apache Kafka

Master Apache Kafka - producers, consumers, topics, partitions, consumer groups, and building event-driven architectures at scale.

KAFKAEVENT-STREAMINGMESSAGE-QUEUEPRODUCER-CONSUMER
16Topics
5Resources
START
INTERMEDIATE

Cross - Account Image Signing & Verification

AWS Signer · Notation · Ratify · Gatekeeper on EKS

AWS signerSignerCross accountratify
10Topics
0Resources
START
INTERMEDIATE

Cross Region Image Signing and Verification

AWS Signer · Notation · Ratify · Gatekeeper on EKS · Account A (us-east-1) signs - Account B (ap-southeast-1) enforces

aws signerawssigning profileOIDC
10Topics
39Resources
START
ADVANCED

Cross-Account Image Signing and Verification

AWS Signer · Notation · Ratify · Gatekeeper on EKS · Account A signs - Account B enforces · Same Region (us-east-1)

AWS signerSignerCross accountratify
8Topics
5Resources
START
ADVANCED

S3 to FSx for OpenZFS

Automate file transfer from Amazon S3 to FSx for OpenZFS so that every file uploaded to a specific S3 path automatically appears on FSx - no manual commands needed after setup.

AWS-S3FSXEVENTBRIDGEMIGRATION
11Topics
4Resources
START
INTERMEDIATE

Python for Beginners - Zero to Confident

Learn Python from absolute zero - variables, data types, loops, functions, files, APIs, and real projects. No prior experience needed.

pythonprogramming-basicsscriptingautomation
17Topics
4Resources
START
BEGINNER

Anomaly Detection with Isolation Forest

Learn anomaly detection with Isolation Forest - understand how the algorithm works, apply it to real ops metrics, and build a complete anomaly detector from scratch.

anomaly-detectionisolation-forestmachine-learningaiops
9Topics
15Resources
START
BEGINNER

Prompting Techniques for Ops Reasoning

Learn prompt engineering from scratch - master zero-shot, few-shot, chain-of-thought, and ops-specific patterns to get reliable results from any LLM.

PROMPT-ENGINEERINGLLMAIOPSCHAIN-OF-THOUGHT
11Topics
3Resources
START
BEGINNER

How AI Agents Work and Tool Calling

Learn how AI agents work from scratch - reasoning loops, tool calling, ReAct pattern, and build a real ops incident investigator agent in pure Python.

ai-agentstool-callingreact-patternllm-agents
11Topics
5Resources
START
INTERMEDIATE

RAG + Vector Databases for Ops Knowledge

Give your agent a memory built from your own runbooks and ops docs - so it stops guessing and starts answering from what your team actually knows.

RAGVECTOR-DATABASEEMBEDDINGSCHROMADB
16Topics
3Resources
START
INTERMEDIATE

Event Correlation & Alert Noise Reduction - prometheus and Alert manager

During a production incident, 200 alerts fire and 180 of them are noise from one root cause. This module teaches you how to group, deduplicate, inhibit, and correlate alerts so your team sees one clear signal instead of a storm

ALERTMANAGERALERT-CORRELATIONNOISE-REDUCTIONPROMETHEUS
10Topics
8Resources
START
INTERMEDIATE

RCA & Runbook Prompt Templates for AIOps

Seven ready-to-use prompt templates for RCA, runbook generation, incident updates, and postmortems — so your AI agent reasons like a senior engineer during real incidents.

PROMPT-ENGINEERINGRCARUNBOOKSAIOPS
6Topics
4Resources
START
INTERMEDIATE

AI Guardrails & Output Validation for AIOps

Build guardrails that prevent your AI agent from running destructive commands in production — with command risk classification, output validation, and human approval gates.

GUARDRAILSAI-SAFETYOUTPUT-VALIDATIONAIOPS
6Topics
11Resources
START
BEGINNER

Running & Choosing AI Models — Ollama, Local & Cloud

Learn when to use cloud models vs local open source models for ops workloads. Run Llama locally with Ollama for sensitive data and high-volume tasks — zero data leaves your network.

OLLAMALOCAL-MODELSLLMCLOUD-MODELS
6Topics
1Resources
START
BEGINNER

Distributed Systems Failure Modes & CAP Theorem

Learn the eight failure patterns that cause production incidents at scale — cascading failures, split-brain, thundering herds, and more — plus CAP theorem, circuit breakers, and the reasoning process that turns alert storms into structured diagnosis.

DISTRIBUTED-SYSTEMSCAP-THEOREMFAILURE-MODESRELIABILITY
6Topics
0Resources
START
INTERMEDIATE

RCA & Production Debugging Workflow

Learn a structured 5-step process to diagnose production incidents — reading logs, metrics, and traces together to find root cause fast instead of guessing.

RCAINCIDENT-RESPONSEDEBUGGINGPROMETHEUS
6Topics
7Resources
START
INTERMEDIATE

Building MCP Servers for Ops Systems

Learn to build MCP servers that give AI agents safe, structured access to your Kubernetes cluster, Prometheus metrics, and internal runbooks.

MCPAI-AGENTSTOOL-CALLINGKUBERNETES
11Topics
33Resources
START
INTERMEDIATE

OpenTelemetry & Distributed Tracing for AIOps

Learn to instrument Python services with OpenTelemetry, collect traces through the OTel Collector, and use distributed tracing to find the exact cause of production incidents.

OPENTELEMETRYDISTRIBUTED-TRACINGOBSERVABILITYPROMETHEUS
12Topics
7Resources
START
BEGINNER

Ansible Runbooks and Self-Healing Infrastructure

Learn to write Ansible playbooks, build automated ops runbooks, and connect Prometheus alerts to self-healing workflows that fix production issues without human intervention.

ANSIBLEAUTO-REMEDIATIONSELF-HEALINGAIOPS
15Topics
5Resources
START
BEGINNER

Time-Series Forecasting with Prophet and ARIMA

Learn to forecast ops metrics, detect anomalies, and predict capacity issues before they happen using Prophet and ARIMA in Python.

TIME-SERIESPROPHETARIMAANOMALY-DETECTION
9Topics
19Resources
START
BEGINNER

DevSecOps Foundations

Learn what DevSecOps is, how attackers think, how mature teams embed security into every stage of software delivery, and how to model threats before writing a single line of code.

DEVSECOPSSECURITYTHREAT-MODELINGSTRIDE
12Topics
9Resources
START
BEGINNER

Git and Version Control Security

Learn how to secure Git repositories from leaked secrets, unauthorized commits, and supply chain attacks - covering pre-commit hooks, Gitleaks, TruffleHog, branch protection, CODEOWNERS, signed commits, and Dependabot.

GITSECRET-SCANNINGGITLEAKSTRUFFLEHOG
10Topics
12Resources
START
BEGINNER

CI/CD and Secure Pipeline Design

Learn how to build and harden CI/CD pipelines - covering OIDC federation, Vault secrets injection, least privilege runners, GitHub Actions permissions, Jenkins hardening, artifact signing with Cosign, and audit logging.

CICDGITHUB-ACTIONSJENKINSOIDC
10Topics
14Resources
START
INTERMEDIATE

Shift Left Security - SAST, SCA, and Secrets Management

Learn how to embed security into the development phase — covering SonarQube SAST with quality gates, Snyk and OWASP Dependency Check for SCA, CVSS vulnerability prioritization, and HashiCorp Vault dynamic secrets management including Kubernetes sidecar injection.

SASTSCASONARQUBESNYK
9Topics
13Resources
START
BEGINNER

Dynamic Testing, SBOM, and Supply Chain Security

Learn DAST with OWASP ZAP, generate Software Bills of Materials with Syft, scan for vulnerabilities with Grype, sign artifacts with Cosign, and understand SLSA provenance - through the lens of real supply chain attacks like SolarWinds and XZ Utils.

DASTOWASP-ZAPSBOMSYFT
8Topics
12Resources
START
INTERMEDIATE

Container and Docker Security

Learn how to secure Docker containers from image hardening and vulnerability scanning to runtime protection - covering non-root users, minimal base images, multi-stage builds, Trivy scanning, seccomp/AppArmor profiles, capability dropping, read-only filesystems, and the Docker socket threat.

DOCKERCONTAINER-SECURITYTRIVYSECCOMP
11Topics
26Resources
START
INTERMEDIATE

Kubernetes Security

Learn how to secure Kubernetes clusters end to end - covering RBAC and least privilege, Pod Security Standards, Network Policies, Secrets encryption at rest, service account hardening, Falco runtime security, and etcd protection.

KUBERNETESRBACNETWORK-POLICIESPOD-SECURITY-STANDARDS
9Topics
36Resources
START
INTERMEDIATE

Cloud Security - AWS

Learn how to secure AWS environments end to end - covering IAM least privilege, S3 public access blocking, CloudTrail audit logging, GuardDuty threat detection, Security Hub compliance, KMS encryption, Secrets Manager, VPC security groups, SCPs with AWS Organizations, and AWS Config compliance monitoring.

AWSIAMGUARDDUTYSECURITY-HUB
14Topics
38Resources
START
INTERMEDIATE

IaC Security - Terraform and Checkov

Learn how to secure Infrastructure as Code with Terraform - covering sensitive variable handling, state file protection, provider credential security, Checkov scanning with CI/CD integration, OPA policy-as-code, and tfsec static analysis to catch misconfigurations before deployment.

TERRAFORMCHECKOVIaC-SECURITYTFSEC
9Topics
40Resources
START
ADVANCED

Identity and Access Security

Master zero trust identity architecture - covering IAM RBAC and JIT access, OIDC workload federation for Kubernetes on AWS and GCP, HashiCorp Vault PKI and dynamic secrets, SPIFFE/SPIRE workload identity, OAuth2 PKCE, Privileged Access Management, Zero Standing Privileges, Kubernetes service account hardening, and automated secrets rotation.

ZERO-TRUSTIAMOIDCVAULT
12Topics
12Resources
START
ADVANCED

Runtime Security and Detection Engineering

Build a production-grade runtime security and detection pipeline -covering eBPF fundamentals, Falco syscall-based threat detection with custom rules, Tetragon eBPF enforcement, MITRE ATT&CK mapping for Cloud and Containers, Sigma rules and SIEM conversion, Elasticsearch SIEM with alert correlation and noise reduction, auditd Linux syscall monitoring, OpenTelemetry for security observability, and automated incident response runbooks.

EBPFFALCOTETRAGONMITRE-ATTACK
13Topics
14Resources
START
BEGINNER

Compliance, Governance and Incident Response

Learn how SOC 2, ISO 27001, and PCI-DSS map to your pipelines, automate compliance evidence, and run incident response from detection to blameless postmortem.

COMPLIANCESOC2ISO27001INCIDENT-RESPONSE
8Topics
7Resources
START
ADVANCED

Advanced Security Topics

Master supply chain security, secrets detection, fuzzing, threat modeling, chaos engineering, eBPF policies, zero-day response, and CIS benchmarks.

DEVSECOPSSUPPLY-CHAINCILIUMCIS-BENCHMARKS
11Topics
8Resources
START
ADVANCED

DevSecOps Deep Dive - Advanced Security for DevOps Engineers

Go beyond CI/CD scanning - learn threat modeling, runtime security, supply chain integrity, cloud security, and compliance automation.

DEVSECOPSRUNTIME-SECURITYSUPPLY-CHAINCLOUD-SECURITY
9Topics
12Resources
START
BEGINNER

Application Fundamentals for Platform Engineers

Understand what developers build and deploy - REST APIs, HTTP, Nginx, databases, environment variables, health checks, and microservices - from an infrastructure perspective, not a developer one.

REST-APIHTTPNGINXDATABASES
14Topics
8Resources
START
BEGINNER

GitOps and Continuous Delivery with ArgoCD

Master GitOps principles and ArgoCD - from understanding why GitOps exists to operating multi-environment deployments, App of Apps, ApplicationSets, and progressive delivery with Argo Rollouts.

GITOPSARGOCDCONTINUOUS-DELIVERYARGO-ROLLOUTS
16Topics
43Resources
START
BEGINNER

FinOps and Cloud Cost Engineering

Master FinOps for Kubernetes — understand unit economics, implement cost visibility with Kubecost, cut EC2 costs 60-70% with Karpenter and Spot instances, and right-size workloads to eliminate waste.

FINOPSKUBECOSTKARPENTERSPOT-INSTANCES
14Topics
36Resources
START
BEGINNER

Platform Engineering Foundations

Learn what separates Platform Engineers from DevOps Engineers - multi-tenancy, admission controllers, policy as code with Kyverno, CRDs, the operator pattern, and Cluster API for declarative cluster lifecycle management.

PLATFORM-ENGINEERINGMULTI-TENANCYKYVERNOADMISSION-CONTROLLERS
11Topics
31Resources
START
ADVANCED

Internal Developer Platforms and Backstage

Build an Internal Developer Platform from first principles - understand why IDPs exist, what problems they solve, how to implement a software catalog and golden paths with Backstage, and how to measure platform success with DORA metrics.

IDPBACKSTAGEGOLDEN-PATHSPLATFORM-ENGINEERING
12Topics
7Resources
START
BEGINNER

Project - Build a Production Application Platform

Build a production-grade application platform from scratch - React frontend, Node.js API, PostgreSQL, Redis — containerised with Docker, deployed to Kubernetes with Ingress, NetworkPolicies, HPA, PDB, resource limits, health checks, Prometheus monitoring, and GitOps delivery via ArgoCD.

CAPSTONEKUBERNETESDOCKERNODEJS
13Topics
50Resources
START
BEGINNER

Project - Build Production AWS Infrastructure with Terraform

Provision a complete production-grade AWS infrastructure from scratch using Terraform - VPC, EKS cluster, IAM roles, IRSA, ECR, and remote state. The infrastructure that Capstone 1's application runs on in a real company.

CAPSTONETERRAFORMAWSEKS
13Topics
51Resources
START
BEGINNER

project - Build a GitOps Delivery Platform with ArgoCD

Build a complete GitOps delivery platform - App of Apps pattern, multi-environment promotion from staging to production, ApplicationSets for scale, and progressive delivery with Argo Rollouts canary deployments.

CAPSTONEGITOPSARGOCDAPP-OF-APPS
12Topics
18Resources
START
BEGINNER

Project - Build a Mini Internal Developer Platform

Build a working Internal Developer Platform using Backstage - software catalog, golden path template that creates a new service end-to-end, TechDocs, and Kubernetes integration.

CAPSTONEIDPBACKSTAGEGOLDEN-PATHS
11Topics
4Resources
START
BEGINNER

Project - Production Readiness Challenge

The final test. Take a running application, inject real production failures - pod crashes, OOMKills, bad deployments, cost spikes, network issues - and learn to diagnose and fix each one.

CAPSTONESREDEBUGGINGOBSERVABILITY
12Topics
4Resources
START
ADVANCED

project - The Platform Engineering Challenge

The mega-capstone. One developer action in Backstage triggers the entire platform - Terraform provisions infrastructure, ArgoCD deploys the application, Prometheus monitors it, Kyverno validates policies, Kubecost tracks spend. Everything from Capstones 1-5 working together as one complete Platform Engineering system.

CAPSTONEPLATFORM-ENGINEERINGBACKSTAGETERRAFORM
11Topics
36Resources
START
BEGINNER

Project - Build an Anomaly Detection Pipeline

Build a real AIOps anomaly detection pipeline - collect live Kubernetes metrics from Prometheus, run machine learning to detect anomalies automatically, and fire alerts when something unusual happens. No more waiting for users to report problems.

CAPSTONEAIOPSANOMALY-DETECTIONPROMETHEUS
14Topics
35Resources
START
BEGINNER

project - Build an AI-Powered Incident Response Agent

Build an AI agent that receives production alerts, queries Prometheus for context, retrieves runbooks via RAG, and suggests remediation steps automatically.

CAPSTONEAIOPSAI-AGENTRAG
16Topics
33Resources
START
BEGINNER

Project - Build a Self-Healing Infrastructure System

Wire anomaly detection, AI diagnosis, and Ansible remediation into one autonomous pipeline that detects, diagnoses, and fixes production incidents without human intervention.

CAPSTONEAIOPSSELF-HEALINGANSIBLE
16Topics
33Resources
START
BEGINNER

DevSecOps project - Build a Fully Gated Secure CI/CD Pipeline

Build a production-grade secure pipeline where every push triggers SAST, SCA, secrets detection, container scanning, SBOM generation, and image signing before any code reaches production.

CAPSTONEDEVSECOPSSASTSCA
14Topics
8Resources
START
BEGINNER

DevSecOps project 2 - Harden a Kubernetes Cluster End to End

Lock down a production Kubernetes cluster using RBAC, Pod Security Admission, NetworkPolicies, OPA Gatekeeper, Falco runtime detection, and automated compliance scanning.

CAPSTONEKUBERNETESDEVSECOPSRBAC
12Topics
33Resources
START
BEGINNER

DevSecOps project 3 — The Platform Security Challenge

The mega-capstone. Wire Terraform IaC scanning, secrets management with Vault, zero-trust mTLS, security chaos engineering, and a unified compliance dashboard into one production security platform.

CAPSTONEDEVSECOPSTERRAFORMVAULT
10Topics
41Resources
START