AI on Bare Metal
Subscribe
Sign in
Effectual LLM inference on Intel CPUs
Francesco Baldassarri
Apr 21, 2024
INT4 weight-only quantization inference with Neural Speed
Read →
Comments
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts
Effectual LLM inference on Intel CPUs
INT4 weight-only quantization inference with Neural Speed