Spotlight

Autoscaling Hid Our LLM Cost Regression (85% → 4% Cache Hit Rate)

Nick Roan

This case study shows how a single RAG chunk size change collapsed vLLM prefix-cache hit rate from 85% to 4%, triggering an 80% GPU replica increase while latency stayed flat.

It also includes the fix: adding a two-phase cache replay gate in CI.

More articles →

Tools and utilities

  • Ingress NGINX Migration

    Official Traefik Labs CLI:

  • Trupositive

    Trupositive is a wrapper that automatically tags Terraform and CloudFormation resources with Git commit SHA, branch, and repository metadata for auditability and infrastructure traceability.

  • Crossview: Crossplane UI

    Crossview is a React-based dashboard for managing and monitoring Crossplane resources in Kubernetes with features like:

  • Teleskopio

    Teleskopio is a small, open-source Kubernetes web client that provides a clean browser interface for viewing and managing cluster resources without the weight of a full platform dashboard.

  • kubevirt-benchmark

    kubevirt-benchmark is a vendor-neutral performance testing toolkit for KubeVirt VMs on OpenShift or any Kubernetes distribution, covering VM provisioning, boot storms, live migration, chaos benchmarking, and failure recovery.

More projects →

Events starting soon

Discover more events onn Kube Events →

AI Agents Running Kubernetes
AI Agents Running Kubernetes

What happens when an AI agent stops generating Kubernetes YAML and starts operating the cluster directly?

Mike Solomon, software engineer at AIATELLA, explains how his team moved from a sprawling Helm setup to Markdown-driven infrastructure specs that Claude Code can execute, test, and refine.

You will learn

  • Why Helm became hard to maintain for a fast-moving medical infrastructure repo
  • How Claude debugged Argo, TLS conflicts, kubectl patches, and private registry credentials
  • How runbooks plus agent memory files capture failures so deployments become reproducible.

It is a practical look at where Kubernetes automation may be heading: less hand-written YAML, more precise intent, and a sharper definition of when the human must stay in the loop.

Learn from production

More case studies →

Matching jobs

    • Data Engineer with OXIO Corporation

    • Salary: $175.5K to $377.3K a year

    • Location: fully remote

    • Tech stack: Kubernetes, AWS, Go, Python, Scala, SQL, Snowflake, Kafka, Airflow, Spark

    • DevOps Engineer with Phonely

    • Salary: $67.5K to $539K a year

    • Location: based in the office in San Francisco, CA, USA

    • Tech stack: Kubernetes, AWS, GCP, ArgoCD, Python, Redis, PostgreSQL, Cloudformation, Pulumi, Terraform

    • DevOps Engineer with Rain Technologies Inc.

    • Salary: $47.97K to $242K a year

    • Location: based in the office in Lisbon, PT

    • Tech stack: Kubernetes, AWS, Helm, Python, Kafka, Terraform, Grafana, Prometheus

    • DevOps Engineer with Regard

    • Salary: $49.5K to $539K a year

    • Location: remote from

    • Tech stack: Kubernetes, AWS, ArgoCD, Docker, Python, Redis, PostgreSQL, Pulumi, Datadog

    • Software Engineer with OXIO Corporation

    • Salary: $9 to $533.5K a year

    • Location: remote from

    • Tech stack: Kubernetes, AWS, Docker, Java, Javascript, Kotlin, Swift, Typescript, Redis, PostgreSQL

Discover more Kubernetes jobs on Kube Careers →

Subscribe to Learn Kubernetes Weekly

Trusted by 77K engineers. Delivered 182 issues and counting.

or subscribe via

Build something

More tutorials →

Call for Papers closing soon

  1. 2

    days

    Devopsdays Kraków

    The Call For Paper is open until 10 May 2026 at GMT-4. More info →
    • Location: Kraków, PL

    • In-person conference organized by Devopsdays.

    • The conference starts on the 4 July 2026.

    • Apply here
  2. 6

    days

    code.talks

    The Call For Paper is open until 15 May 2026 at GMT-4. More info →
    • Location: Hamburg, DE

    • In-person conference organized by code.talks.

    • The conference starts on the 5 November 2026.

    • Apply here
  3. 7

    days

    Devopsdays Denver

    The Call For Paper is open until 16 May 2026 at GMT-4. More info →
    • Location: Denver, CO, USA

    • In-person conference organized by Devopsdays.

    • The conference starts on the 22 September 2026.

    • Apply here
  4. 7

    days

    Michigan Technology Conference 2026

    The Call For Paper is open until 16 May 2026 at GMT-4. More info →
    • Location: Rochester, MI, USA

    • In-person conference organized by The Michigan Technology Conference Association.

    • The conference starts on the 30 October 2026.

    • Apply here
  5. 8

    days

    TechEx North America

    The Call For Paper is open until 17 May 2026 at GMT-4. More info →
    • Location: San Jose, CA, USA

    • In-person conference organized by TechEx Events.

    • The conference starts on the 19 May 2026.

    • Apply here
  6. 9

    days

    Devopsdays London

    The Call For Paper is open until 17 May 2026 at GMT-4. More info →
    • Location: London, UK

    • In-person conference organized by Devopsdays.

    • The conference starts on the 17 September 2026.

    • Apply here
  7. 9

    days

    Devopsdays Rio de Janeiro

    The Call For Paper is open until 17 May 2026 at GMT-4. More info →
    • Location: Rio de Janeiro, BR

    • In-person conference organized by Devopsdays.

    • The conference starts on the 15 August 2026.

    • Apply here

Thanks to our sponsors who make Kube Today possible

Find out more about being a sponsor →

More articles

Even more articles →