Gremlin
Chaos engineering platform for building resilient systems through fault injection
Gremlin
Chaos engineering platform for building resilient systems through fault injection
Gremlin is a chaos engineering platform that helps engineering teams proactively test system resilience by safely injecting failures—CPU spikes, network delays, instance terminations, and dependency outages—in controlled environments. Unlike running chaos experiments manually, Gremlin provides guided runbooks, blast radius controls that prevent experiments from cascading, and reliability scoring that tracks improvement over time. Gremlin is used by Netflix, Amazon, and Expedia to harden distributed systems against production failures.
Key Features
- ✓Fault injection
- ✓Blast radius controls
- ✓Guided runbooks
- ✓Reliability scoring
- ✓Kubernetes support
- ✓Scheduled attacks
Quick Info
- Category
- Code & Development
- Pricing
- Freemium
More Code & Development Tools
GitHub Copilot
Code & DevelopmentThe AI pair programmer trusted by millions of developers
Cursor
Code & DevelopmentThe code editor built around AI from the ground up
Tabnine
Code & DevelopmentPrivacy-first AI code completion
Codeium
Code & DevelopmentFree AI coding assistant with no usage limits