N
Hacker Next
new
past
show
ask
show
jobs
submit
login
▲
Google's TurboQuant AI-compression algorithm can reduce LLM memory usage by 6x
(
arstechnica.com
)
18 points by
gmays
1 days ago
|
3 comments
add comment
Rendered at 22:42:14 GMT+0000 (Coordinated Universal Time) with Cloudflare Workers.
redanddead 1 days ago
[-]
You'd think it'd be bigger news on hn
axiologist 1 days ago
[-]
See
https://news.ycombinator.com/item?id=47513475
from two days ago.