What Happens When You Push GPT to Think Clearly, Not Safely
Most alignment research focuses on how GPT sounds — not how it reasons. This post explores frame tagging: a method for surfacing the moral and epistemic lenses GPT uses, testing structural coherence, and building a model that thinks clearly, not just fluently.