Starred repositories
DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents
Python Deep Agent framework built on top of Pydantic-AI, designed to help you quickly build production-grade autonomous AI agents with planning, filesystem operations, subagent delegation, skills, …
Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution
MCP server for Todoist integration enabling natural language task management with Claude
ToolForge: A Data Synthesis Pipeline for Multi-Hop Search without Real-World APIs
Our survey's paper list on Agentic AI, continuously updated with the latest research.
An open-source, cross-platform terminal for seamless workflows
Introduction about AWESOME_ENTROPY+LRM_PAPERS
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
EvoRL is a fully GPU-accelerated framework for Evolutionary Reinforcement Learning, implemented with JAX. It supports Reinforcement Learning (RL), Evolutionary Computation (EC), Evolution-guided Re…
Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework
verl: Volcano Engine Reinforcement Learning for LLMs
[COLING'25] RRHF-V: Ranking Responses to Mitigate Hallucinations in Multimodal Large Language Models with Human Feedback
[ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems
Biomni: a general-purpose biomedical AI agent
The absolute trainer to light up AI agents.
Automatically crawl arXiv papers daily and summarize them using AI. Illustrating them using GitHub Pages.
This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!
Convert Markdown mathematical formulas in various formats to a format supported by Feishu (Lark).
A Curated Benchmark Repository for Medical Vision-Language Models
MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search tools.

