Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
Collection of AWESOME vision-language models for vision tasks
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
🚀 「大模型」3小时从0训练27M参数的视觉多模态VLM!🌏 Train a 27M-parameter VLM from scratch in just 3 hours!
Docker Image for Ubuntu Desktop which support HW GPU accelerated GUI apps. you can access the Container with ssh or remote desktop, just like Cloud VM.
tutorial for writing custom pytorch cpp+cuda kernel, applied on volume rendering (NeRF)
Reimplementation of World-Models (Ha and Schmidhuber 2018) in pytorch
Implementation of an RL based agent, which utilizes Q-Learning to develop a policy for effectively solving a 3x3x3 rubiks cube
Calibrate the camera with ZhangZhengyou method (in both distortion case and no distortion case)
An open-source impl. of Large Reconstruction Models
An open source implementation of CLIP.
Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)
A collection of resources and papers on Diffusion Models
A generative world for general-purpose robotics & embodied AI learning.
convert markdown to zhihu compatible format.
This is a step by step instructions of how to install CUDA, CuDNN, TensorFlow and Pytorch
Python bindings to the pointcloud library (pcl)
A modern Neovim configuration with full battery for Python, Lua, C++, Markdown, LaTeX, and more...
Python Kalman filtering and optimal estimation library. Implements Kalman filter, particle filter, Extended Kalman filter, Unscented Kalman filter, g-h (alpha-beta), least squares, H Infinity, smoo…
Yet another PyTorch implementation of Stable Diffusion (probably easy to read)
Ongoing research training gaussian splatting at scale by distributed system
Full python interactive 3D Gaussian Splatting viewer for real-time editing and analyzing.
3D plotting and mesh analysis through a streamlined interface for the Visualization Toolkit (VTK)
Python code to fuse multiple RGB-D images into a TSDF voxel volume.
Repository for the code used in the Medium articles about Python libraries for 3D analysis, visualization and manipulation of point clouds and meshes