🐙

OctoAI

AI compute service for running and customizing open-source models at scale

Code & Development

OctoAI

AI compute service for running and customizing open-source models at scale

Code & DevelopmentFreemium

OctoAI (now part of AMD) is an AI compute service that makes it easy to run, customize, and deploy open-source AI models including Llama, Stable Diffusion, and Whisper at production scale. Its platform handles model optimization, hardware selection, and auto-scaling automatically, while its fine-tuning service adapts models to custom domains with small datasets. OctoAI's endpoint templates enable teams to deploy complex multi-model pipelines as a single API.