Welcome to Putting NeRF on a Diet Project! This project is the Pytorch, JAX/Flax based code implementation of this paper [Putting NeRF on a Diet : Ajay Jain, Matthew Tancik, Pieter Abbeel, Arxiv : https://arxiv.org/abs/2104.00677] The model generates the novel view synthesis redering (NeRF: Neural Radiances Field) with Fewshot learning scheme. The semantic loss use the pre-trained CLIP Vision Transformer embedding. This information can give a 2D supervision for 3D. The Diet NeRF result outperforms the original NeRF in 3D reconstruction and neural rendering with only few images.
We will also upload our project on the Hugging Face Hub Repository Also. https://huggingface.co/flax-community/putting-nerf-on-a-diet/
Our JAX/Flax implementation currently supports:
Platform | Single-Host GPU | Multi-Device TPU | ||
---|---|---|---|---|
Type | Single-Device | Multi-Device | Single-Host | Multi-Host |
Training | ||||
Evaluation |
- Streamlit Space Demo
You can check our Streamlit Space demo on following site ! With any input camera pose, we can render the novel view synthesis. https://huggingface.co/spaces/flax-community/DietNerf-Demo
- Colab Demo
Moreover, we prapare the colab ipython notebook for you. You need colab pro account for running our model on the colab(For memory issue) https://colab.research.google.com/drive/1etYeMTntw5mh3FvJv4Ubb7XUoTtt5J9G?usp=sharing
# Clone the repo
svn export https://github.com/google-research/google-research/trunk/jaxnerf
# Create a conda environment, note you can use python 3.6-3.8 as
# one of the dependencies (TensorFlow) hasn't supported python 3.9 yet.
conda create --name jaxnerf python=3.6.12; conda activate jaxnerf
# Prepare pip
conda install pip; pip install --upgrade pip
# Install requirements
pip install -r jaxnerf/requirements.txt
# [Optional] Install GPU and TPU support for Jax
# Remember to change cuda101 to your CUDA version, e.g. cuda110 for CUDA 11.0.
pip install --upgrade jax jaxlib==0.1.57+cuda101 -f https://storage.googleapis.com/jax-releases/jax_releases.html
# install flax and flax-transformer
pip install flax transformer[flax]
Download the datasets from the NeRF official Google Drive.
Please download the nerf_synthetic.zip
and unzip them
in the place you like. Let's assume they are placed under /tmp/jaxnerf/data/
.
You can check more detail explaination about DietNeRF on following Notion Report
- ๐๐ VEEEERY Detail DietNeRF Explaination Docs : https://www.notion.so/DietNeRF-Putting-NeRF-on-a-Diet-4aeddae95d054f1d91686f02bdb74745
Based on the principle that โa bulldozer is a bulldozer from any perspectiveโ, Our proposed DietNeRF supervises the radiance field from arbitrary poses (DietNeRF cameras). This is possible because we compute a semantic consistency loss in a feature space capturing high-level scene attributes, not in pixel space. We extract semantic representations of renderings using the CLIP Vision Transformer, then maximize similarity with representations of ground-truth views. In effect, we use prior knowledge about scene semantics learned by single-view 2D image encoders to constrain a 3D representation.
You can check detail information on the author's paper. Also, you can check the CLIP based semantic loss structure on the following image.
Our code used JAX/FLAX framework for implementation. So that it can achieve much speed up than other NeRF codes. At last, our code used hugging face, transformer, CLIP model library.
python -m train \
--data_dir=/PATH/TO/YOUR/SCENE/DATA \ % e.g., nerf_synthetic/lego
--train_dir=/PATH/TO/THE/PLACE/YOU/WANT/TO/SAVE/CHECKPOINTS \
--config=configs/CONFIG_YOU_LIKE
You can toggle the semantic loss by โuse_semantic_lossโ in configuration files.
DietNeRF has a strong capacity to generalise on novel and challenging views with EXTREMELY SMALL TRAINING SAMPLES!
We made aritificial occlusion on the right side of image (Only picked left side training poses). The reconstruction quality can be compared with this experiment. Diet NeRF shows better quailty than Original NeRF when It is occluded.
Diet NeRF | NeRF |
Diet NeRF | NeRF |
Teams | Members |
---|---|
Project Managing | Stella Yang To Watch Our Project Progress, Please Check Our Project Notion |
NeRF Team | Stella Yang, Alex Lau, Seunghyun Lee, Hyunkyu Kim, Haswanth Aekula, JaeYoung Chung |
CLIP Team | Seunghyun Lee, Sasikanth Kotti, Khali Sifullah , Sunghyun Kim |
Cloud TPU Team | Alex Lau, Aswin Pyakurel , JaeYoung Chung, Sunghyun Kim |
- Extremely Don't Sleep Contributors ๐คฃ : Seunghyun Lee, Alex Lau, Stella Yang, Haswanth Aekula
- Neural rendering with fewshot images
- Hugging face CLIP based semantic loss loop
- You can choose coarse mlp / coarse + fine mlp training
(coarse + fine is on the
main
branch / coarse is on thecoarse_only
branch)- coarse + fine : shows good geometric reconstruction
- coarse : shows good PSNR / SSIM result
- Make Video / GIF rendering result,
--generate_gif_only
arg can run fast rendering GIF. - Cleaning / refactoring the code
- Made multiple models / colab / space for nice demo
- Game Industry
- Augmented Reality Industry
- Virtual Reality Industry
- Graphics Industry
- Online shopping
- Metaverse
- Digital Twin
- Mapping / SLAM
This project is based on โJAX-NeRFโ.
@software{jaxnerf2020github,
author = {Boyang Deng and Jonathan T. Barron and Pratul P. Srinivasan},
title = {{JaxNeRF}: an efficient {JAX} implementation of {NeRF}},
url = {https://github.com/google-research/google-research/tree/master/jaxnerf},
version = {0.0},
year = {2020},
}
This project is based on โPutting NeRF on a Dietโ.
@misc{jain2021putting,
title={Putting NeRF on a Diet: Semantically Consistent Few-Shot View Synthesis},
author={Ajay Jain and Matthew Tancik and Pieter Abbeel},
year={2021},
eprint={2104.00677},
archivePrefix={arXiv},
primaryClass={cs.CV}
}
Our Project is started in the HuggingFace X GoogleAI (JAX) Community Week Event. https://discuss.huggingface.co/t/open-to-the-community-community-week-using-jax-flax-for-nlp-cv/7104
Thank you for our mentor Suraj and organizers in JAX/Flax Community Week! Our team grows up with this community learning experience. It was wonderful time!
Common Computer AI(https://comcom.ai/ko/) sponsored the multiple V100 GPUs for our project! Thank you so much for your support!