The Newsletter of Record
for the Future of Now
Token Wisdom
No. 153
SAT · JUN 13, 2026
Subscribe
← The Lexicon
Technologies

Weak-to-Strong Generalization

The empirical question of whether a weaker supervisory model can reliably elicit aligned behavior from a stronger model it cannot fully evaluate

— defined in 150th Edition, Mar 10, 2026
1editions defined
Mar 2026first defined
Mar 2026most recent
Technologiescategory

How the definition evolved (2 versions)

Defined in (1)

150th EditionW10 · Mar 10, 2026