Show HN: Steerling-8B, a language model that can explain any token it generates
We release Steerling-8B, an 8B-parameter causal diffusion language model that is interpretable by construction — its predictions are routed through concepts you can measure, audit, and contr...