home / resources / blog / tag / kubernetes

Posts tagged kubernetes.

ZopDev writing tagged kubernetes. Engineering and FinOps notes, post-mortems, and benchmarks.

kubernetes

The IDP Adoption Problem: Why Most Platforms Fail

Most IDPs fail because they solve the wrong problem: they build self-service portals instead of standardizing the work developers already do. We measured this in production. Teams spend six months…

Muskan Bandta May 22 · 14 min

kubernetes

The Real Cost of Building an IDP: Breaking Down the $400k

Building an Internal Developer Platform for 12 teams costs $400,000 (Platform Engineering for 12 Teams: The $400k IDP Bill), and understanding where that money goes determines whether you build or…

Muskan Bandta May 20 · 12 min

cloudops

The Fargate Tax: Why Serverless Kubernetes Costs 38% More Past 200 vCPU-Hours

The Fargate Tax: Why Serverless Kubernetes Costs 38% More Past 200 vCPU-Hours Fargate is appealing because the pitch is clean: no AMI patching, no node group sizing, no cluster autoscaler tuning. You…

Bableen Kaur May 13 · 9 min

cloudops

Kubernetes MTTR: From 43 Minutes to 9 With Structured Runbooks

Kubernetes MTTR: From 43 Minutes to 9 With Structured Runbooks The median Kubernetes incident takes 43 minutes to resolve. Eight minutes of that is the actual fix. The other 35 minutes is engineers…

Riya Mittal May 13 · 9 min

cloudops

Event Readiness: A Wizard for Pre-Scaling Infrastructure Before a Launch

A product team announces a new feature on Tuesday at 10:00 AM Pacific. The marketing email goes out at 09:55. By 10:01 the load balancer is seeing 8 times its baseline request rate. The autoscaler is…

Muskan Bandta May 12 · 10 min

platform-engineering

ZopDay: Provisioning EKS, GKE, AKS, and a Managed Datastore in One 8-Step Wizard

A new engineer joins on Monday. By Friday they need their first production-grade EKS cluster running so they can deploy the service they were hired to build. They open the company's Terraform module.…

Muskan Bandta May 12 · 9 min

cloudops

Live Kubernetes Visibility: 21 Resource Pages and the Crashloop Overview

A 500-pod cluster has one pod that restarted three times in the last 10 minutes. The operator on call does not know which pod. returns 500 lines of and a handful of interleaved through them. Finding…

Muskan Bandta May 11 · 11 min

finops

Stop the App Before the Database: Dependency-Aware Shutdown for Non-Prod Environments

A team writes the cron job that shuts non-prod down at 8 PM. The cron runs three commands in parallel: scale the EKS Deployments to zero, pause the Aurora cluster, stop the ElastiCache Redis nodes.…

Aryan Mehrotra May 8 · 8 min

cloudops

Pod Scheduling for the Frugal: How We Cut EKS Node Cost 31% Without Touching a Workload

A right-sized EKS cluster should not run at 40 percent node utilization. The pods declare requests that sum to 78 percent of node capacity. The cluster autoscaler provisions nodes to fit those…

Bableen Kaur May 8 · 11 min

cloudops

Closed-Loop SRE for Kubernetes: Auto-Remediating Pod Crashloops Before the On-Call Pages

The 3am page is rarely about something that needs a human. The on-call gets paged at 03:14 because a pod has crashlooped four times in five minutes. They open Slack, look at the logs, see "OOMKilled"…

Muskan Bandta May 7 · 9 min

cloudops

Kubernetes CPU Throttling Is Lying to You: Why Container Limits Bleed Latency

The dashboard says CPU throttling is at 0.5%. The p99 latency on that container says 30% of requests just lost 80 milliseconds to scheduling delay. Both numbers are correct. They are measuring…

Riya Mittal May 6 · 10 min

kubernetes

eBPF Gives Kubernetes Full Network Visibility Without the Sidecar CPU Tax

Istio sidecars cost 0.5 vCPU per pod at idle. At 100 pods, you're paying for 50 idle vCPUs. eBPF moves observability into the kernel — one hook point per node, not per pod. Here's the architecture, the tools, and when you still need Envoy.

Riya Mittal Apr 27 · 9 min

platform-engineering

Golden Paths That Include Cost Guardrails: A Platform Engineering Playbook

Every service provisioned from a Backstage template starts with zero budget alerts, zero mandatory tags, and a dev environment that runs 24/7. The platform team didn't choose this — they just never added cost defaults to the template. Here's how to fix that.

Riya Mittal Apr 27 · 8 min

kubernetes

Kubernetes Admission Controllers Block Oversized Pods Before They Drain Your Budget

OPA Gatekeeper rejects a pod before it ever runs. Here is how to write admission policies that block oversized resource requests, missing cost labels, and non-prod images at deploy time, not billing time.

Bableen Kaur Apr 23 · 9 min

kubernetes

Kubernetes Network Policies Cut Egress Bills, Not Just Attack Surface

Unrestricted pod egress runs every outbound call through NAT Gateway at $0.045 per GB. NetworkPolicy is both a security control and a cost control. Here is how to use it as both.

Amanpreet Kaur Apr 22 · 10 min

kubernetes

Why Kubernetes Cluster Autoscaler Loses to Karpenter After 6 Months in Production

Cluster Autoscaler works on day one. Six months in, you have 12 node groups, 30% idle capacity, and scaling incidents during traffic ramps. Here is what changes when you switch to Karpenter.

Muskan Bandta Apr 21 · 11 min

kubernetes

Kubernetes Multi-Tenancy: Resource Quotas, Namespace Isolation, and the Cost of Getting It Wrong

Shared clusters without hard quotas become tragedy-of-the-commons cost problems. One team's memory leak becomes everyone's OOM. Here's how LimitRanges, ResourceQuotas, and namespace cost attribution fix that.

Amanpreet Kaur Apr 17 · 9 min

kubernetes

The Real Cost of a Service Mesh: Istio Sidecar Overhead in Production

Istio adds 50-100m CPU and 50-100Mi memory per pod at idle. At 100 pods that's 10 extra CPU cores. Here's the overhead math at scale, what you actually get for it, and when lighter alternatives make more sense.

Muskan Bandta Apr 17 · 8 min

kubernetes

Kubernetes VPA vs HPA vs KEDA: Which Autoscaler Actually Cuts Your Bill

The average Kubernetes cluster runs at 13% CPU utilization. VPA, HPA, and KEDA each attack the 87% idle gap differently — here's which one cuts your bill and which one creates production incidents.

Riya Mittal Apr 16 · 9 min

kubernetes

Policy-as-Code with OPA Gatekeeper: Stopping Cloud Waste Before It Deploys

SCPs block cloud-level overprovisioning but can't see inside a Kubernetes cluster. OPA Gatekeeper fills the admission control gap — blocking wasteful pod specs before they ever schedule.

Muskan Bandta Apr 16 · 8 min

kubernetes

Kubernetes Multi-Tenancy: Namespace Isolation, RBAC, and Network Policies Explained

Most teams running shared Kubernetes clusters believe they have isolation. They have namespaces. It feels like separation. It is not. Here's how to configure actual multi-tenancy.

Amanpreet Kaur Apr 13 · 8 min

apache-cassandra

Cassandra on Kubernetes: Where Distributed State Meets Distributed Control

Running Apache Cassandra on Kubernetes is an architectural commitment. Explore token rings, stateful identity, and operational risks in this technical guide.

Talvinder Singh Feb 20 · 4 min

cloud-infrastructure

PostgreSQL on Kubernetes: An Architectural Boundary, Not a Deployment Choice

Explore the mechanics of running PostgreSQL on Kubernetes. Learn about WAL, storage, and replication to manage operational risks and ensure database durability.

Talvinder Singh Feb 20 · 4 min

aiops

DevOps Trends to Watch in 2025

DevOps is evolving fast. Discover the top 12 trends shaping DevOps in 2025—from SRE and automation to AIOps and culture—and how your team can stay ahead.

Talvinder Singh Jun 13 · 5 min

cloud-native

Kubernetes Production Checklist: Building Robust, Scalable Cloud-Native Infrastructure

Ready to deploy Kubernetes in production? This comprehensive Kubernetes production checklist by Zopdev covers essential best practices for stability, observability, and cost efficiency.

Talvinder Singh May 30 · 5 min

automation

CI/CD Pipelines: A Deep Dive into Implementation Strategies

Explore advanced strategies for implementing CI/CD pipelines. Dive into pipeline architectures, branching models, automated testing, and deployment techniques for modern software delivery.

Talvinder Singh May 7 · 3 min

cloud-automation

Why does Kubernetes feel so complicated?

Kubernetes is powerful—but let’s face it, it often feels like a black box wrapped in YAML. This blog breaks down why Kubernetes feels so overwhelming and shows you how to simplify it using real-world tools like Terraform, GitOps, and automation platforms like Zopdev.

Talvinder Singh Apr 11 · 4 min

← Back to all posts

Get the weekly in your inbox.

One post a week. Sundays. No "10 ways to think about cloud" listicles, just the engineering and FinOps notes we'd want to read.

ZopNight

ZopDay

ZopCloud

The IDP Adoption Problem: Why Most Platforms Fail

Founded 2024.

Careers

Contact

Posts tagged kubernetes.

The IDP Adoption Problem: Why Most Platforms Fail

The Real Cost of Building an IDP: Breaking Down the $400k

The Fargate Tax: Why Serverless Kubernetes Costs 38% More Past 200 vCPU-Hours

Kubernetes MTTR: From 43 Minutes to 9 With Structured Runbooks

Event Readiness: A Wizard for Pre-Scaling Infrastructure Before a Launch

ZopDay: Provisioning EKS, GKE, AKS, and a Managed Datastore in One 8-Step Wizard

Live Kubernetes Visibility: 21 Resource Pages and the Crashloop Overview

Stop the App Before the Database: Dependency-Aware Shutdown for Non-Prod Environments

Pod Scheduling for the Frugal: How We Cut EKS Node Cost 31% Without Touching a Workload

Closed-Loop SRE for Kubernetes: Auto-Remediating Pod Crashloops Before the On-Call Pages

Kubernetes CPU Throttling Is Lying to You: Why Container Limits Bleed Latency

eBPF Gives Kubernetes Full Network Visibility Without the Sidecar CPU Tax

Golden Paths That Include Cost Guardrails: A Platform Engineering Playbook

Kubernetes Admission Controllers Block Oversized Pods Before They Drain Your Budget

Kubernetes Network Policies Cut Egress Bills, Not Just Attack Surface

Why Kubernetes Cluster Autoscaler Loses to Karpenter After 6 Months in Production

Kubernetes Multi-Tenancy: Resource Quotas, Namespace Isolation, and the Cost of Getting It Wrong

The Real Cost of a Service Mesh: Istio Sidecar Overhead in Production

Kubernetes VPA vs HPA vs KEDA: Which Autoscaler Actually Cuts Your Bill

Policy-as-Code with OPA Gatekeeper: Stopping Cloud Waste Before It Deploys

Kubernetes Multi-Tenancy: Namespace Isolation, RBAC, and Network Policies Explained

Cassandra on Kubernetes: Where Distributed State Meets Distributed Control

PostgreSQL on Kubernetes: An Architectural Boundary, Not a Deployment Choice

DevOps Trends to Watch in 2025

Kubernetes Production Checklist: Building Robust, Scalable Cloud-Native Infrastructure

CI/CD Pipelines: A Deep Dive into Implementation Strategies

Why does Kubernetes feel so complicated?

Get the weekly in your inbox.

Stop watching the waste.
Start cutting it.

Get the weekly in your inbox.

Stop watching the waste.Start cutting it.

Stop watching the waste.
Start cutting it.