Highlights
- Pro
Starred repositories
A curated list of awesome model based RL resources (continually updated)
The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation
TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.
multi-aspect feedback for improving reasoning chain quality
world modeling challenge for humanoid robots
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.
llama3 implementation one matrix multiplication at a time
Minimal implementation of scalable rectified flow transformers, based on SD3's approach
IndicGenBench is a high-quality, multilingual, multi-way parallel benchmark for evaluating Large Language Models (LLMs) on 4 user-facing generation tasks across a diverse set 29 of Indic languages …
DSPy: The framework for programming—not prompting—language models
Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective…
An open-source impl. of Large Reconstruction Models
Applying RL methods for autonomous driving in Carla simulator.
Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.
📖 A collection of object-compositional modeling by implicit neural representation.
NeRFshop: Interactive Editing of Neural Radiance Fields
[ICCV 2023] A latent space for stochastic diffusion models
collection of diffusion model papers categorized by their subareas
Official pytorch implementation of the paper: "An Edit Friendly DDPM Noise Space: Inversion and Manipulations". CVPR 2024.
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites