TriAttention: KV Cache Compression Enables 32B Models on Consumer GPUs

Tuesday, April 7, 2026