4 days ago
Tether Brings Google’s TurboQuant to Production, Unlocking Long-Context AI on Ev...
TLDR: TurboQuant compresses AI KV cache memory by up to five times with minimal impact on model quality. The upgrade enables lapto...
May 07, 2026
Tether Launches Medical AI Models That Run on Phones and Beat Larger Systems
TLDR: Tether 1.7B model beat Google’s MedGemma-4B by 11+ points despite being less than half the size. The 4B model cuts token out...
