Skip to content
@researchim-ai

researchim-ai

Popular repositories Loading

  1. fast-start fast-start Public

    Быстро стартуем

    1 1

  2. simpleRL-reason simpleRL-reason Public

    Forked from hkust-nlp/simpleRL-reason

    This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

    Python 1

  3. RAGEN RAGEN Public

    Forked from ZihanWang314/RAGEN

    RAGEN is the first open-source reproduction of DeepSeek-R1 for training agentic models via reinforcement learning.

    Python 1

  4. reasoning-gym-ru reasoning-gym-ru Public

    Forked from open-thought/reasoning-gym

    procedural reasoning datasets

    Python

  5. state-of-ai state-of-ai Public

    Собираем и объясняем статьи

  6. verl verl Public

    Forked from volcengine/verl

    veRL: Volcano Engine Reinforcement Learning for LLM

    Python

Repositories

Showing 8 of 8 repositories
  • agents-samples Public

    Заготовки под агентов

    researchim-ai/agents-samples’s past year of commit activity
    0 0 0 0 Updated Feb 6, 2025
  • RAGEN Public Forked from ZihanWang314/RAGEN

    RAGEN is the first open-source reproduction of DeepSeek-R1 for training agentic models via reinforcement learning.

    researchim-ai/RAGEN’s past year of commit activity
    Python 1 Apache-2.0 44 0 0 Updated Feb 6, 2025
  • verl Public Forked from volcengine/verl

    veRL: Volcano Engine Reinforcement Learning for LLM

    researchim-ai/verl’s past year of commit activity
    Python 0 Apache-2.0 198 0 0 Updated Feb 6, 2025
  • s1 Public Forked from simplescaling/s1

    s1: Simple test-time scaling

    researchim-ai/s1’s past year of commit activity
    Python 0 356 0 0 Updated Feb 6, 2025
  • reasoning-gym-ru Public Forked from open-thought/reasoning-gym

    procedural reasoning datasets

    researchim-ai/reasoning-gym-ru’s past year of commit activity
    Python 0 Apache-2.0 34 0 0 Updated Feb 6, 2025
  • state-of-ai Public

    Собираем и объясняем статьи

    researchim-ai/state-of-ai’s past year of commit activity
    0 0 0 0 Updated Jan 28, 2025
  • simpleRL-reason Public Forked from hkust-nlp/simpleRL-reason

    This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

    researchim-ai/simpleRL-reason’s past year of commit activity
    Python 1 MIT 166 0 0 Updated Jan 28, 2025
  • fast-start Public

    Быстро стартуем

    researchim-ai/fast-start’s past year of commit activity
    1 1 0 0 Updated Jan 25, 2025

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…