int8 quantization #32

shindavid · 2023-01-30T16:17:58Z

Model quantization (using int8's instead of floats for faster inference) is all the rage these days, it seems. The Oracle Devs AlphaZero blog post series writes extensively about how this improved inference throughput (4x they claim).

We should experiment with this. I have minimal familiarity with this technique.

shindavid added learning improvement modeling and removed learning improvement labels Jan 30, 2023

shindavid mentioned this issue Apr 3, 2023

Increase number of convolutional filters in network heads #54

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

int8 quantization #32

int8 quantization #32

shindavid commented Jan 30, 2023

int8 quantization #32

int8 quantization #32

Comments

shindavid commented Jan 30, 2023