Startup Subquadratic Claims Sparse-Attention LLM Cuts Costs and Boosts Speed

June 20, 2026

Source: https://www.technologyreview.com/2026/06/19/1139313/a-startup-claims-it-broke-through-a-bottleneck-thats-holding-back-llms/

Executive Briefing

Announces SubQ, a sparse-attention LLM purportedly 56 times faster than FlashAttention-based models in speed benchmarks
Scores 89.7% on LiveCodeBench, placing it alongside top coding models from OpenAI, Google DeepMind, and Anthropic
Claims dramatic cost reduction, citing $8 versus $2,600 to run the same large-dataset retrieval test against Anthropic's Opus 4.6
Offers a 12-million-token context window, roughly 12 times larger than most current frontier models
Third-party evaluator Appen validates core architectural claims, though SubQ remains unavailable for broad public testing
Founders assert transformers could become obsolete within years if sparse-attention approaches gain wider adoption

Share: Facebook LinkedIn Email

Sponsored

Daytona 116503, Automatic, Steel and Yellow Gold, 40

$27995.00

Machenike G3V2 Bluetooth Controller for Pc/Switch/Ios/Android, Hall Effect Joysticks, RGB Lighting Gaming Controller,2 Programmable Buttons,1000mah Battery With Charging Station, Pink

$56.24

PATBO, Panthera Bikini Top

$275.00

Pelican - Boat Intruder 12 - Jon Fishing Boat - 12 ft

$849.99

Startup Subquadratic Claims Sparse-Attention LLM Cuts Costs and Boosts Speed

Executive Briefing

More Stories

White House AI Export Ban on Anthropic's Mythos Echoes Failed Crypto Wars History

Norway Bans Generative AI for Elementary Students Over Learning Development Concerns

Figma CEO: Designers Should Push Beyond AI's 'Average' Creative Output