Skip to content
View jwgu's full-sized avatar

Block or report jwgu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[CVPR2025] A Physics-Informed Blur Learning Framework for Imaging Systems

MATLAB 13 2 Updated Mar 20, 2025

Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics

Python 89 7 Updated Mar 15, 2025

A suite of image and video neural tokenizers

Jupyter Notebook 1,606 73 Updated Feb 11, 2025

MoVQGAN - model for the image encoding and reconstruction

Jupyter Notebook 229 15 Updated Oct 31, 2023
Python 98 6 Updated Aug 16, 2024

SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning

Python 514 57 Updated Dec 6, 2024

Efficiently apply modification functions to RLDS/TFDS datasets.

Python 11 10 Updated Jun 19, 2024

Paper list in the survey paper: Toward General-Purpose Robots via Foundation Models: A Survey and Meta-Analysis

421 29 Updated Jan 23, 2025
Jupyter Notebook 354 23 Updated Sep 26, 2024

ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation

Python 721 77 Updated Feb 20, 2025

DROID Policy Learning and Evaluation

Python 176 14 Updated Dec 21, 2024

Official PyTorch implementation of TATS: A Long Video Generation Framework with Time-Agnostic VQGAN and Time-Sensitive Transformer (ECCV 2022)

Python 282 19 Updated May 1, 2024

Blend Between Multiple Images in JupyterLab.

Jupyter Notebook 113 11 Updated Apr 1, 2025

A sample html to compare two videos with slider animation using

HTML 4 Updated Jan 27, 2023

Official inference repo for FLUX.1 models

Python 21,297 1,509 Updated Feb 6, 2025

Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA

Python 1,556 79 Updated Sep 25, 2024

A collection of my personal dotfiles

Lua 590 94 Updated Mar 25, 2025

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 819 44 Updated Mar 20, 2025

Official repository for "AM-RADIO: Reduce All Domains Into One"

Python 1,079 42 Updated Mar 27, 2025

The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)

Jupyter Notebook 866 93 Updated Jan 3, 2024

A PyTorch native library for large model training

Python 3,582 333 Updated Apr 12, 2025

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 2,530 320 Updated Mar 23, 2025

Deep Fourier Upsampling

Python 69 5 Updated Mar 26, 2024

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Python 3,995 351 Updated Aug 7, 2024

Implementation of CoCa, Contrastive Captioners are Image-Text Foundation Models, in Pytorch

Python 1,125 88 Updated Dec 12, 2023

A minimal, but effective implementation of CLIP (Contrastive Language-Image Pretraining) in PyTorch

Jupyter Notebook 27 5 Updated Feb 14, 2024

Democratization of RT-2 "RT-2: New model translates vision and language into action"

Python 441 63 Updated Jul 26, 2024

[ICRA 2023] A Joint Modeling of Vision-Language-Action for Target-oriented Grasping in Clutter

Python 131 20 Updated Apr 4, 2025

My implementation of "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"

Python 226 10 Updated Apr 4, 2025
Next
Showing results