paper-summary
Found 1 post tagged with "paper-summary".
Paper Summary: Constitutional AI - Training Harmless AI Without Human Labels
2026-01-303 min
Anthropic's Constitutional AI trains models to be harmless using self-critique and AI feedback - reducing reliance on human labelers while improving both safety and helpfulness.