Hacker Times

HomeNewBestShowAboutSearchTrends

Speculative KV coding: losslessly compressing KV cache by up to ~4×

fergusfinn.com