Startup Subquadratic Claims Sparse-Attention LLM Cuts Costs and Boosts Speed
Executive Briefing
- Announces SubQ, a sparse-attention LLM purportedly 56 times faster than FlashAttention-based models in speed benchmarks
- Scores 89.7% on LiveCodeBench, placing it alongside top coding models from OpenAI, Google DeepMind, and Anthropic
- Claims dramatic cost reduction, citing $8 versus $2,600 to run the same large-dataset retrieval test against Anthropic's Opus 4.6
- Offers a 12-million-token context window, roughly 12 times larger than most current frontier models
- Third-party evaluator Appen validates core architectural claims, though SubQ remains unavailable for broad public testing
- Founders assert transformers could become obsolete within years if sparse-attention approaches gain wider adoption
Sponsored
Daytona 116503, Automatic, Steel and Yellow Gold, 40
$27995.00
Machenike G3V2 Bluetooth Controller for Pc/Switch/Ios/Android, Hall Effect Joysticks, RGB Lighting Gaming Controller,2 Programmable Buttons,1000mah Battery With Charging Station, Pink
$56.24
PATBO, Panthera Bikini Top
$275.00
Pelican - Boat Intruder 12 - Jon Fishing Boat - 12 ft
$849.99