Label-free-Network-Compression

Caffe implementation of "Learning Compression from Limited Unlabeled Data" (ECCV2018). Quantizing full-precision deep neural networks to 4-bit using only 1K unlabeled images.

How to use?

Part I. Create Quantized Model and Prototxt

# Python2.7
cd ./python
vim config.py # edit pycaffe_path / model_name / train_dataset path / val_dataset path according to your env
python weights_quan.py # quantize weights to 4-bit
python renorm.py # Batch-Norm re-normalization in CPU mode
python activations_quan.py # quantize activations to 8-bit

Part II. Test on validation set

Add act_quantize.cpp and act_quantize.cu to your_caffe_root/src/caffe/layers/.
Add act_quantize.hpp to your_caffe_root/include/caffe/layers/.
make all -j2
make pycaffe
./build/tools/caffe test --weights /your/BN_quantized_caffemodel/in/config.py --model /your/val_prototxt/in/config.py --gpu XX --iterations 1000 # val_batch_size = 50 in default (Line 10 in config.py)

WARNING:

renorm.py will use 1K images to update Batch-Norm parameters in default. The memory consumption can be pretty large for deep networks (>12G). You may edit Line 8 in config.py to alleviate this problem.

Results:

Models	Weights	Activations	Top-1 (%)	Top-5 (%)
AlexNet-BN	32-bit	32-bit	60.43	82.47
ReNorm	4-bit	8-bit	60.12	82.22
ResNet-18	32-bit	32-bit	69.08	89.03
ReNorm	4-bit	8-bit	67.48	88.02
ResNet-50	32-bit	32-bit	75.30	92.11
ReNorm	4-bit	8-bit	73.82	91.33
VGG16-BN	32-bit	32-bit	70.44	89.94
ReNorm	4-bit	8-bit	69.15	89.52

Details:

We report the 224x224 single-crop (cropped from 256xN/Nx256 images) validation accuracy on the ImageNet validation set. BN parameters are updated using 1K randomly selected unlabeled training images.
We quantize the first and last layer to 8-bit using fixed-point quantizer.

Network Structure:

We add a scale layer after each quantized convolution layer, i.e.

Blob[0] in Conv :
Blob[1] in Conv :
Blob[0] in newly-added scale layer :
Blob[1] in newly-added scale layer :

Reference:

@inproceedings{Label-free,
  author    = {Xiangyu He and
               Jian Cheng},
  title     = {Learning Compression from Limited Unlabeled Data},
  booktitle = {Computer Vision - {ECCV} 2018 - 15th European Conference, Munich,
               Germany, September 8-14, 2018, Proceedings, Part {I}},
  pages     = {778--795},
  year      = {2018}
}

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
python		python
qmodel		qmodel
LICENSE		LICENSE
README.md		README.md
act_quantize.cpp		act_quantize.cpp
act_quantize.cu		act_quantize.cu
act_quantize_layer.hpp		act_quantize_layer.hpp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

python

python

qmodel

qmodel

LICENSE

LICENSE

README.md

README.md

act_quantize.cpp

act_quantize.cpp

act_quantize.cu

act_quantize.cu

act_quantize_layer.hpp

act_quantize_layer.hpp

Repository files navigation

Label-free-Network-Compression

How to use?

Part I. Create Quantized Model and Prototxt

Part II. Test on validation set

WARNING:

Results:

Details:

Network Structure:

Reference:

About

Releases

Packages

Languages

License

HolmesShuan/Label-free-Network-Compression

Folders and files

Latest commit

History

Repository files navigation

Label-free-Network-Compression

How to use?

Part I. Create Quantized Model and Prototxt

Part II. Test on validation set

WARNING:

Results:

Details:

Network Structure:

Reference:

About

Topics

Resources

License

Stars

Watchers

Forks

Languages