CWAM_IC

Official Pytorch Implementation for "Enhancing Learned Image Compression via Cross Window-based Attention", ISVC, 2024

Figure: Our framework

Acknowledgement

The framework is based on CompressAI, we add our model in compressai.models.ours, compressai.models.our_utils. We modify compressai.utils, compressai.zoo, compressai.layers and examples/train.py for usage. Part of the codes benefit from The Devil Is in the Details: Window-based Attention for Image Compression, Video Frame Interpolation with Transformer, and Enhanced Invertible Encoding for Learned Image Compression.

Introduction

In this paper, we introduce a feature encoding and decoding module that improves CNNs’ ability to handle complex data representations. This module includes dense blocks and convolutional layers, which strengthen feature propagation and encourage feature reuse effectively. It is integrated in a residual manner for effectiveness. Then, we adopt a modular attention module that can be combined with neural networks to capture correlations among spatially neighboring elements while considering the wider receptive field. This component can be integrated with CNNs to further enhance their performance.

[Paper]

Figure: Our results

Installation

As mentioned in CompressAI, "A C++17 compiler, a recent version of pip (19.0+), and common python packages are also required (see setup.py for the full list)."

git clone https://github.com/prmudgal/CWAM_IC_ISVC.git
cd CWAM_IC_ISVC/codes/
conda create -n cwamic python=3.7 
conda activate cwamic
pip install -U pip && pip install -e .
conda install -c conda-forge tensorboard

Evaluation

If you want evaluate with pretrained model, please download from [Google drive] and put in ./experiments/

Pretrained Models

Pretrained models (optimized for MSE) trained from scratch using randomly chose 300k images from the OpenImages dataset.

Loss	Lambda	Link
MSE	0.0045	mse_0045
MSE	0.00975	mse_00975
MSE	0.0175	mse_0175
MSE	0.0483	mse_0483
MSE	0.09	mse_09
MSE	0.14	mse_14
MSSSIM	873	msssim_873
MSSSIM	1664	msssim_1664
MSSSIM	3184	msssim_3184
MSSSIM	6050	msssim_6050

Further trained models shall be uploaded soon!!!!

Some evaluation dataset can be downloaded from kodak dataset, CLIC

Note that as mentioned in original CompressAI, "Inference on GPU is not recommended for the autoregressive models (the entropy coder is run sequentially on CPU)." So for inference of our model, please run on CPU.

python -m compressai.utils.eval_model checkpoint $eval_data_dir -a invcompress -exp $exp_name -s $save_dir

An example: to evaluate model of quality 1 optimized with mse on kodak dataset.

python -m compressai.utils.eval_model checkpoint ../data/kodak -a invcompress -exp exp_01_mse_q1 -s ../results/exp_01

If you want to evaluate your trained model on own data, please run update before evaluation. An example:

python -m compressai.utils.update_model -exp $exp_name -a invcompress
python -m compressai.utils.eval_model checkpoint $eval_data_dir -a invcompress -exp $exp_name -s $save_dir

Train

We use the training dataset processed in the repo. We further preprocess with /codes/scripts/flicker_process.py Training setting is detailed in the paper. You can also use your own data for training.

python examples/train.py -exp $exp_name -m cwam -d $train_data_dir --epochs $epoch_num -lr $lr --batch-size $batch_size --cuda --gpu_id $gpu_id --lambda $lamvda --metrics $metric --save

An example: to train model of quality 1 optimized with mse metric.

python examples/train.py -exp exp_01_mse_q1 -m cwam -d ../data/flicker --epochs 600 -lr 1e-4 --batch-size 8 --cuda --gpu_id 0 --lambda 0.00475 --metrics mse --save

Other usage please refer to the original library CompressAI

Citation

If you find this work useful for your research, please cite:

@inproceedings{mudgal2024enhancing,
  title={Enhancing Learned Image Compression via Cross Window-Based Attention},
  author={Mudgal, Priyanka and Liu, Feng},
  booktitle={International Symposium on Visual Computing},
  pages={410--423},
  year={2024},
  organization={Springer}
}

Contact

Feel free to contact us if there is any question. ([email protected])

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
CompressionData-master		CompressionData-master
codes		codes
experiments		experiments
figures		figures
results		results
tb_logger/exp_08_mse_q8		tb_logger/exp_08_mse_q8
CompressionData-master.zip		CompressionData-master.zip
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CWAM_IC

Acknowledgement

Introduction

Installation

Evaluation

Pretrained Models

Train

Citation

Contact

About

Releases

Packages

Contributors 2

Languages

License

prmudgal/CWAM_IC_ISVC

Folders and files

Latest commit

History

Repository files navigation

CWAM_IC

Acknowledgement

Introduction

Installation

Evaluation

Pretrained Models

Train

Citation

Contact

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages