researchim-ai
Popular repositories Loading
-
-
simpleRL-reason
simpleRL-reason PublicForked from hkust-nlp/simpleRL-reason
This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data
Python 1
-
RAGEN
RAGEN PublicForked from ZihanWang314/RAGEN
RAGEN is the first open-source reproduction of DeepSeek-R1 for training agentic models via reinforcement learning.
Python 1
-
reasoning-gym-ru
reasoning-gym-ru PublicForked from open-thought/reasoning-gym
procedural reasoning datasets
Python
-
-
verl
verl PublicForked from volcengine/verl
veRL: Volcano Engine Reinforcement Learning for LLM
Python
Repositories
- simpleRL-reason Public Forked from hkust-nlp/simpleRL-reason
This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data
researchim-ai/simpleRL-reason’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…