trt_quantization

implicit quantization (PTQ) TensorRT example

How to Run

generate .onnx from timm model

pip install torch
pip install onnx
pip install timm
pip install cuda-python
pip install tensorrt
pip install onnx-simplifier

python onnx_export.py
// a file 'resnet18_cuda.onnx' will be generated in onnx directory.

prepare calibration datas

mkdir calib_data
// insert 3-500 calib datas for model (no need label)

build tensorrt model and run

python onnx2trt.py
// a file 'resnet18_int8.engine' will be generated in engine directory.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

trt_quantization

How to Run

Files

README.md

Latest commit

History

README.md

File metadata and controls

trt_quantization

How to Run