GitHub - jianwang-scu/mmtm: Implementation of CVPR 2020 paper "MMTM: Multimodal Transfer Module for CNN Fusion"

MMTM: Multimodal Transfer Module for CNN Fusion

Code for the paper MMTM: Multimodal Transfer Module for CNN Fusion. This is a reimplementation of the original MMTM code to reproduce the results on NTU RGB+D dataset in Table 5 of the paper.

If you use this code, please cite the paper:

@inproceedings{vaezi20mmtm,
 author = {Vaezi Joze, Hamid Reza and Shaban, Amirreza and Iuzzolino, Michael L. and Koishida, Kazuhito},
 booktitle = {Conference on Computer Vision and Pattern Recognition ({CVPR})},
 title = {MMTM: Multimodal Transfer Module for CNN Fusion},
 year = {2020}
}

Installation

This code has been tested on Ubuntu 16.04 with Python 3.8.3 and PyTorch 1.5.0.

Install Pytorch.
Install tqdm by running pip install tqdm.
Install opencv by running pip install opencv-python.
Install matplotlib by running pip install matplotlib.
Install sklearn by running pip install sklearn.

Download the pre-trained checkpoints and prepare NTU RGB+D dataset

Clone this repository along with MFAS submodule by running git clone --recurse-submodules https://github.com/haamoon/mmtm.git
Download and uncompress the checkpoints and place them in 'ROOT/checkpoint' dicrectory.
Download NTU RGB+D dataset.
Copy all skeleton files to ROOT/NUT/nturgbd_skeletons/ directory.
Change all video clips resolution to 256x256 30fps and copy them to ROOT/NTU/nturgbd_rgb/avi_256x256_30/ directory.

Evaluation

Run python main_mmtm_ntu.py --datadir ROOT/NTU --checkpointdir ROOT/checkpoints --test_cp fusion_mmtm_epoch_8_val_loss_0.1873.checkpoint --no_bad_skel.
Reduce the batch size if run out of memeory e.g. --batchsize 1.
Add --use_dataparallel to use multiple GPUs.

Training

Run python main_mmtm_ntu.py --datadir ROOT/NTU --checkpointdir ROOT/checkpoints --train --ske_cp skeleton_32frames_85.24.checkpoint --rgb_cp rgb_8frames_83.91.checkpoint.
We have trained the model with --batchsize 20 --use_dataparallel options on 4 GPUs.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
mfas @ 1e9a715		mfas @ 1e9a715
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
bad_skel.txt		bad_skel.txt
main_mmtm_ntu.py		main_mmtm_ntu.py
mmtm.py		mmtm.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MMTM: Multimodal Transfer Module for CNN Fusion

Installation

Download the pre-trained checkpoints and prepare NTU RGB+D dataset

Evaluation

Training

About

Releases

Packages

Languages

License

jianwang-scu/mmtm

Folders and files

Latest commit

History

Repository files navigation

MMTM: Multimodal Transfer Module for CNN Fusion

Installation

Download the pre-trained checkpoints and prepare NTU RGB+D dataset

Evaluation

Training

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages