Safety Pretraining: Toward the Next Generation of Safe AI
Pratyush Maini*
Sachin Goyal*
Dylan Sam*
Alex Robey
Yash Savani
Yiding Jiang
Andy Zou
Matt Fredrikson
Zachary C. Lipton
J. Zico Kolter
Carnegie Mellon University
DatologyAI
Center for AI Safety
Gray Swan AI
* Equal contribution
TL;DR: We embed safety directly into the pretraining pipeline with data‑centric interventions, delivering a 1.7B parameter model family that is natively safe. Everything (code, data & weights) is open‑source.