Highlights
- Pro
Starred repositories
A curated list of awesome model based RL resources (continually updated)
The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation
TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.
multi-aspect feedback for improving reasoning chain quality
world modeling challenge for humanoid robots
Get up and running with Llama 3.3, Phi 4, Gemma 2, and other large language models.
llama3 implementation one matrix multiplication at a time
Minimal implementation of scalable rectified flow transformers, based on SD3's approach
IndicGenBench is a high-quality, multilingual, multi-way parallel benchmark for evaluating Large Language Models (LLMs) on 4 user-facing generation tasks across a diverse set 29 of Indic languages …
DSPy: The framework for programming—not prompting—language models
Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective…
An open-source impl. of Large Reconstruction Models
Applying RL methods for autonomous driving in Carla simulator.
Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.
📖 A collection of object-compositional modeling by implicit neural representation.
NeRFshop: Interactive Editing of Neural Radiance Fields
[ICCV 2023] A latent space for stochastic diffusion models
collection of diffusion model papers categorized by their subareas
Official pytorch implementation of the paper: "An Edit Friendly DDPM Noise Space: Inversion and Manipulations". CVPR 2024.
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites