alexunderch

Follow

😇

Sacha Chernyavskiy alexunderch

😇

Follow

19 followers · 307 following

Achievements

Achievements

Highlights

Pro

Lists (3)

Sort

🔮 Future ideas

✨ Inspiration

🚀 My stack

Starred repositories

nikitaved / einsum_decomp

A presentation explaining how Einsum could be understood and implemented.

2 Updated Dec 4, 2024

anyasims / edge-of-reach

Official implementation of Reach-Aware Value Estimation (RAVL) from the paper: "The Edge-of-Reach Problem in Offline Model-Based Reinforcement Learning."

Python 6 Updated May 22, 2024

aslansd / modified-ai-economist-gabm

The extended AI-Economist which will be compared with the extended Concordia, a generative agent-based model (GABM)

Jupyter Notebook 1 Updated Nov 21, 2024

facebookresearch / DeepRL-continuing-tasks

This repository contains the codebase for the research paper "An Empirical Study of Deep Reinforcement Learning in Continuing Tasks".

Python 3 Updated Nov 23, 2024

HDU-VRLab / Ground4Act

This repository contains the implementation of Ground4Act, a two-stage approach for collaborative pushing and grasping in clutter using a visual-language model.

23 Updated Oct 8, 2024

bytedance / jaqmc

JAX accelerated Quantum Monte Carlo

Python 58 8 Updated Dec 4, 2024

mohmdelsayed / streaming-drl

Deep reinforcement learning without experience replay, target networks, or batch updates.

Python 151 11 Updated Nov 30, 2024

instadeepai / qd-skill-discovery-benchmark

Neuroevolution is a Competitive Alternative to Reinforcement Learning for Skill Discovery

Python 13 2 Updated Jul 11, 2023

compsciencelab / amaro

Python 8 2 Updated Sep 20, 2024

lucas-maes / nano-simsiam

Minimalistic, hackable PyTorch implementation of SimSiam in ~400 lines. Achieves good performance on ImageNet with ResNet50. Features distributed training, real-time KNN eval, and AMP. Perfect for …

Python 19 3 Updated Nov 25, 2024

C16Mftang / place-cell-pred-coding

Learning grid cells by predictive coding

Python 4 Updated Sep 26, 2024

MichalBortkiewicz / JaxGCRL

Goal-Conditioned Reinforcement Learning with JAX

Jupyter Notebook 98 13 Updated Dec 2, 2024

kvfrans / jax-diffusion-transformer

Implementation of Diffusion Transformer (DiT) in JAX

Python 255 4 Updated Jun 11, 2024

cvxgrp / cvxpylayers

Differentiable convex optimization layers

Python 1,835 162 Updated Nov 16, 2024

BirkhoffG / jax-dataloader

Pytorch-like dataloaders in JAX.

Jupyter Notebook 63 3 Updated Oct 18, 2024

nnaisense / evotorch

Advanced evolutionary computation library built directly on top of PyTorch, created at NNAISENSE.

Python 1,023 63 Updated Nov 18, 2024

AIDC-AI / Marco-o1

An Open Large Reasoning Model for Real-World Solutions

Python 1,075 53 Updated Nov 28, 2024

allenai / open-instruct

Python 1,979 220 Updated Dec 5, 2024

flowersteam / vivarium

Multi-agent simulator in Jax for research and teaching in AI & ALife

Jupyter Notebook 11 Updated Dec 4, 2024

balrog-ai / BALROG

Benchmarking Agentic LLM and VLM Reasoning On Games

Python 57 12 Updated Dec 3, 2024

kuleshov-group / awesome-discrete-diffusion-models

A curated list for awesome discrete diffusion models resources.

164 6 Updated Nov 25, 2024

rl-tools / rl-tools

The Fastest Deep Reinforcement Learning Library

C++ 652 24 Updated Dec 4, 2024

PathPlanning / TP-SWAP

Source code and data for algorithms designed for decentralized anonymous/unlabeled multi-agent pathfinding (AMAPF)

Python 5 Updated Nov 18, 2024

lucidrains / dreamerv3-pytorch

Implementation of DreamerV3 in Pytorch

Python 37 1 Updated Nov 19, 2024

JuliaReinforcementLearning / ReinforcementLearning.jl

A reinforcement learning package for Julia

Julia 592 112 Updated Oct 18, 2024

FLAIROx / jafar

JAX reimplementation of the DeepMind paper "Genie: Generative Interactive Environments"

Python 39 4 Updated Nov 27, 2024

skylooop / ICL_Modular_Arithmetic

Forked from ablghtianyi/ICL_Modular_Arithmetic

Python 1 Updated Oct 31, 2024

Timothyxxx / WorldModelPapers

Paper collections of the continuous effort start from World Models.

144 6 Updated Jul 6, 2024

clement-bonnet / lpn

Latent Program Network (from the "Searching Latent Program Spaces" paper)

Jupyter Notebook 31 1 Updated Nov 28, 2024

gauthamvasan / avg

Action Value Gradient Algorithm

Python 15 Updated Nov 28, 2024

Starred topics

Bash