The Newsletter of Record
for the Future of Now
Token Wisdom
No. 153
SAT · JUN 13, 2026
Subscribe
← The Lexicon
Technical Terms

Sparse Attention

An attention mechanism that computes relationships between a subset of token positions rather than all pairs; reduces quadratic scaling cost of full attention while preserving most information for relevant contexts

— defined in 152th Edition, Mar 24, 2026
1editions defined
Mar 2026first defined
Mar 2026most recent
Technical Termscategory

Defined in (1)

152th EditionW12 · Mar 24, 2026