Skip to content
View linjh1118's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report linjh1118

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
TypeScript 3,511 391 Updated Feb 26, 2026

DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents

Python 596 65 Updated Feb 16, 2026

Python Deep Agent framework built on top of Pydantic-AI, designed to help you quickly build production-grade autonomous AI agents with planning, filesystem operations, subagent delegation, skills, …

Python 351 36 Updated Feb 24, 2026

Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution

Python 68 5 Updated Dec 8, 2025

MCP server for Todoist integration enabling natural language task management with Claude

JavaScript 368 73 Updated Apr 20, 2025

A Systematic Survey of Deep Research

305 15 Updated Jan 1, 2026

ToolForge: A Data Synthesis Pipeline for Multi-Hop Search without Real-World APIs

Python 10 Updated Jan 26, 2026

Our survey's paper list on Agentic AI, continuously updated with the latest research.

88 5 Updated Oct 28, 2025
Python 28 Updated Jan 11, 2026

An open-source, cross-platform terminal for seamless workflows

Go 17,527 781 Updated Feb 26, 2026

Introduction about AWESOME_ENTROPY+LRM_PAPERS

30 1 Updated Dec 16, 2025

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 39,938 4,828 Updated Feb 6, 2026

EvoRL is a fully GPU-accelerated framework for Evolutionary Reinforcement Learning, implemented with JAX. It supports Reinforcement Learning (RL), Evolutionary Computation (EC), Evolution-guided Re…

Python 260 36 Updated Feb 11, 2026

Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework

Python 223 14 Updated Jan 17, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 19,384 3,291 Updated Feb 26, 2026

[COLING'25] RRHF-V: Ranking Responses to Mitigate Hallucinations in Multimodal Large Language Models with Human Feedback

Python 3 Updated Sep 28, 2025

[ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems

Python 125 7 Updated Jun 11, 2025
Python 4 Updated Aug 14, 2025

Biomni: a general-purpose biomedical AI agent

Python 2,693 475 Updated Feb 23, 2026

对飞书笔记进行截图,以及生成文案用于发小红书

Python 2 Updated Aug 14, 2025

The absolute trainer to light up AI agents.

Python 15,182 1,290 Updated Feb 11, 2026

Automatically crawl arXiv papers daily and summarize them using AI. Illustrating them using GitHub Pages.

JavaScript 2,385 829 Updated Feb 26, 2026

带有双池子的混合策略强化学习,基于LUFFY开发

Python 1 Updated Aug 3, 2025

This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!

1,354 60 Updated Dec 7, 2025
Python 46 2 Updated Jul 1, 2025

Convert Markdown mathematical formulas in various formats to a format supported by Feishu (Lark).

Python 5 Updated Dec 29, 2025

A Curated Benchmark Repository for Medical Vision-Language Models

181 15 Updated Jan 21, 2026

MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search tools.

Python 400 21 Updated Aug 26, 2025
Next