Skip to main content
👾

Gremlin

Chaos engineering platform for building resilient systems through fault injection

Code & Development
Gremlin logo

Gremlin

Chaos engineering platform for building resilient systems through fault injection

Gremlin is a chaos engineering platform that helps engineering teams proactively test system resilience by safely injecting failures—CPU spikes, network delays, instance terminations, and dependency outages—in controlled environments. Unlike running chaos experiments manually, Gremlin provides guided runbooks, blast radius controls that prevent experiments from cascading, and reliability scoring that tracks improvement over time. Gremlin is used by Netflix, Amazon, and Expedia to harden distributed systems against production failures.

Key Features

  • Fault injection
  • Blast radius controls
  • Guided runbooks
  • Reliability scoring
  • Kubernetes support
  • Scheduled attacks
#chaos-engineering#reliability#resilience#sre#kubernetes

Get Started

Visit Gremlin
🔵
Freemium
Free plan + paid upgrades

Quick Info

Category
Code & Development
Pricing
Freemium

More Code & Development Tools