Stars
This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data
vaibhav016 / gpt-neox
Forked from EleutherAI/gpt-neoxAn implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
Awesome In-Context RL: A curated list of In-Context Reinforcement Learning - - —
Deep and online learning with spiking neural networks in Python
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Muesli RL algorithm implementation (PyTorch) (LunarLander-v2)
Official repo for the paper titled "Learning by Competition of Self-Interested Reinforcement Learning Agents"
Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.
Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
[NeurIPS 2021 Spotlight] Aligning Pretraining for Detection via Object-Level Contrastive Learning
🐍 Geometric Computer Vision Library for Spatial AI
High-quality implementations of standard and SOTA methods on a variety of tasks.
Bayesian Deep Learning Benchmarks
[NeurIPS 2021] Data-Efficient Instance Generation from Instance Discrimination
(TPAMI2022) The ImageNet-S benchmark/method for large-scale unsupervised/semi-supervised semantic segmentation.
Whitening for Self-Supervised Representation Learning | Official repository
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
Taming Transformers for High-Resolution Image Synthesis
PyTorch package for the discrete VAE used for DALL·E.
Pitfalls of In-Domain Uncertainty Estimation and Ensembling in Deep Learning, ICLR 2020
Official code release for "GRAF: Generative Radiance Fields for 3D-Aware Image Synthesis"