⭐

Starling LLM

Open-source model using AI feedback reinforcement learning for safer responses

AI Infrastructure & MLOps

Starling LLM

Open-source model using AI feedback reinforcement learning for safer responses

AI Infrastructure & MLOpsFree

Starling is an open-source language model developed by researchers at UC Berkeley using a technique called Reinforcement Learning from AI Feedback (RLAIF), where AI models provide the preference data used for alignment rather than human annotators. This approach scales alignment training more efficiently. Starling achieved impressive performance on chat benchmarks relative to its size. It represents an important research direction for making AI alignment more scalable. AI alignment researchers, ML practitioners, and open-source AI enthusiasts use Starling to study and build upon RLAIF techniques.