What is DaemonSet? | DevOps Dictionary

DaemonSet — One Agent Per Node, Always

What is a DaemonSet in Simple Terms?

A DaemonSet is a standing order to the scheduler: "This pod must always run on every node — one copy per node, no more, no less." Not one total replica. One per node. When a new node joins the cluster, the DaemonSet pod is automatically placed on it. When a node is removed, the pod is cleaned up automatically.

DaemonSet vs Deployment — The Key Difference

◈ DIAGRAM

+------------------------------------------+     +------------------------------------------+
|             Deployment                   |     |              DaemonSet                   |
|                                          |     |                                          |
|  You control replica count               |     |  Cluster controls replica count          |
|  replicas: 3 (on any 3 nodes)            |     |  1 pod per node (on ALL nodes)           |
|                                          |     |                                          |
|  Use for: application workloads          |     |  Use for: infrastructure agents          |
+------------------------------------------+     +------------------------------------------+

One Pod Per Node — How It Looks

◈ DIAGRAM

+---------------+  +---------------+  +---------------+  +---------------+
|  mumbai-node-1|  |  mumbai-node-2|  |  mumbai-node-3|  |  mumbai-node-4|
|               |  |               |  |               |  |               |
|  [fluentd-0]  |  |  [fluentd-1]  |  |  [fluentd-2]  |  |  [fluentd-3]  |
+---------------+  +---------------+  +---------------+  +---------------+
        ^                                                          ^
        |                                                          |
        New node added to cluster -----> DaemonSet auto-schedules pod here

When to Use a DaemonSet

Use DaemonSet for infrastructure agents:

Log collection — Fluentd, Filebeat, Promtail (must read log files from every node's disk)
Metrics scraping — Node Exporter (must collect CPU/memory/disk from every node)
Network plugins — Calico, Cilium CNI agents (must configure networking on every node)
Security agents — Falco, CrowdStrike (must inspect every node's syscalls and processes)
Storage drivers — CSI node drivers that attach and mount volumes

Do NOT use DaemonSet for:

Application workloads (use Deployment)
Batch processing (use Jobs or CronJobs)
Anything where you want explicit control over replica count

A Real DaemonSet — Node Exporter for Prometheus

YAML

1# node-exporter-daemonset.yaml
2# Runs Prometheus Node Exporter on every node in mumbai-prod-cluster
3apiVersion: apps/v1
4kind: DaemonSet
5metadata:
6  name: node-exporter
7  namespace: monitoring
8spec:
9  selector:
10    matchLabels:
11      app: node-exporter
12  template:
13    metadata:
14      labels:
15        app: node-exporter
16    spec:
17      hostNetwork: true              # Uses the node's network namespace directly
18      hostPID: true                  # Sees all processes on the node (required for metrics)
19      tolerations:
20        - operator: Exists           # Tolerate ALL taints — run on control-plane nodes too
21          effect: NoSchedule
22      containers:
23        - name: node-exporter
24          image: prom/node-exporter:v1.7.0
25          ports:
26            - containerPort: 9100
27              hostPort: 9100         # Binds directly to the node's port 9100
28          args:
29            - '--path.procfs=/host/proc'
30            - '--path.sysfs=/host/sys'
31          volumeMounts:
32            - name: proc
33              mountPath: /host/proc
34              readOnly: true
35            - name: sys
36              mountPath: /host/sys
37              readOnly: true
38      volumes:
39        - name: proc
40          hostPath:
41            path: /proc              # Mounts the node's /proc filesystem
42        - name: sys
43          hostPath:
44            path: /sys               # Mounts the node's /sys filesystem

Targeting Specific Nodes — Not Always Every Node

You do not always want a DaemonSet on every node. A GPU monitoring agent should only run on GPU nodes:

YAML

1# nodeSelector — simple label match
2spec:
3  template:
4    spec:
5      nodeSelector:
6        accelerator: nvidia-gpu      # Only schedule on nodes labelled as GPU nodes

For more complex targeting, use nodeAffinity:

YAML

1# nodeAffinity — skip control-plane, run only on worker nodes
2affinity:
3  nodeAffinity:
4    requiredDuringSchedulingIgnoredDuringExecution:
5      nodeSelectorTerms:
6        - matchExpressions:
7            - key: node-role
8              operator: In
9              values:
10                - worker             # Excludes control-plane nodes explicitly

Tolerations — Getting onto Tainted Nodes

Control-plane nodes carry a default taint: node-role.kubernetes.io/control-plane:NoSchedule. Without a matching toleration, your DaemonSet will skip them. The operator: Exists toleration bypasses ALL taints — use it for critical agents that must run everywhere like Falco or CNI plugins.

YAML

1tolerations:
2  - key: node-role.kubernetes.io/control-plane
3    operator: Exists
4    effect: NoSchedule              # Specific — only bypass this one taint

Key DaemonSet Commands

Task	Command
List DaemonSet status	`kubectl get ds -n monitoring`
See which nodes have the pod	`kubectl get pods -o wide -n monitoring`
Check rollout status	`kubectl rollout status ds/node-exporter -n monitoring`
Force restart all pods	`kubectl rollout restart ds/node-exporter -n monitoring`
Describe DaemonSet events	`kubectl describe ds/node-exporter -n monitoring`
Check pod count vs node count	`kubectl get ds node-exporter -n monitoring`

Bash

1# Verify a DaemonSet pod is running on every node
2kubectl get pods -n monitoring -l app=node-exporter -o wide
3 
4# Output should show one pod per node:
5# NAME                  READY   NODE
6# node-exporter-4xk2p   1/1     mumbai-node-1
7# node-exporter-7hq9r   1/1     mumbai-node-2
8# node-exporter-m2pzn   1/1     mumbai-node-3

⚠️ Security: Setting hostNetwork: true and hostPID: true gives the container full visibility into the host's network stack and every process running on the node. Only use these flags for trusted infrastructure agents like Node Exporter or Falco — never for application workloads. In Hotstar or PhonePe production clusters, DaemonSet pods with host access should be reviewed as part of every security audit.

📌 Remember: DaemonSet pod count equals node count. If your cluster has 12 nodes and your DaemonSet shows 10 pods, two nodes have issues — either they are tainted without a matching toleration, or the pods are failing on those nodes. Use kubectl get pods -o wide to identify which nodes are missing coverage.

🔴 Common Mistake: Using a Deployment with replicas matching your node count as a substitute for a DaemonSet. If a node is added later, the Deployment will not automatically place a pod on it — you are back to manual scaling. Always use a DaemonSet for anything that must run on every node.

💡 Tip: In Zerodha or Swiggy-scale clusters, DaemonSets for log collection (Fluentd/Promtail) are often the highest-volume pods in the cluster. Set proper resources.requests and resources.limits on them — an unthrottled Fluentd pod can consume enough CPU to starve application pods on the same node during log bursts.

Syncing Data

DaemonSet

Technical Explanation & Usage

DaemonSet — One Agent Per Node, Always

What is a DaemonSet in Simple Terms?

DaemonSet vs Deployment — The Key Difference

One Pod Per Node — How It Looks

When to Use a DaemonSet

A Real DaemonSet — Node Exporter for Prometheus

Targeting Specific Nodes — Not Always Every Node

Tolerations — Getting onto Tainted Nodes

Key DaemonSet Commands

Related Terms

Namespace

Pod

Deployment