The algorithm achieves up to an eight-times performance boost over unquantized keys on Nvidia H100 GPUs.
TurboQuant is aimed at reducing the size of the key-value cache, which Google likens to a “digital cheat sheet” that stores ...
Memory stocks declined Wednesday as investors reacted to Google’s announcement of TurboQuant, a new compression algorithm designed to reduce memory requirements for AI systems, even as the broader ...
Google thinks it's found the answer, and it doesn't require more or better hardware. Originally detailed in an April 2025 ...
Memory stocks fell Wednesday despite broader technology sector strength, with shares dropping after Google unveiled ...