linjh1118

🎯

Focusing

Jinghao Lin linjh1118

🎯

Focusing

MLLM Post-Training in MSRA (Previously Zhipu @THUDM @Tencent @baidu @telecom)

38 followers · 11 following

Achievements

Starred repositories

cxcscmu / deepresearch_benchmarking

Python 26 1 Updated Jul 29, 2025

m1heng / clawdbot-feishu

TypeScript 3,511 391 Updated Feb 26, 2026

Ayanami0730 / deep_research_bench

DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents

Python 596 65 Updated Feb 16, 2026

vstorm-co / pydantic-deepagents

Python Deep Agent framework built on top of Pydantic-AI, designed to help you quickly build production-grade autonomous AI agents with planning, filesystem operations, subagent delegation, skills, …

Python 351 36 Updated Feb 24, 2026

OPPO-PersonalAI / Flash-Searcher

Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution

Python 68 5 Updated Dec 8, 2025

abhiz123 / todoist-mcp-server

MCP server for Todoist integration enabling natural language task management with Claude

JavaScript 368 73 Updated Apr 20, 2025

mangopy / Deep-Research-Survey

A Systematic Survey of Deep Research

305 15 Updated Jan 1, 2026

Buycar-arb / ToolForge

ToolForge: A Data Synthesis Pipeline for Multi-Hop Search without Real-World APIs

Python 10 Updated Jan 26, 2026

ADaM-BJTU / model-native-agentic-ai

Our survey's paper list on Agentic AI, continuously updated with the latest research.

88 5 Updated Oct 28, 2025

linjh1118 / AwesomeRM

Python 28 Updated Jan 11, 2026

wavetermdev / waveterm

An open-source, cross-platform terminal for seamless workflows

Go 17,527 781 Updated Feb 26, 2026

wzhwzhwzh0921 / Awesome_LRM_with_Entropy

Introduction about AWESOME_ENTROPY+LRM_PAPERS

30 1 Updated Dec 16, 2025

jingyaogong / minimind

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT！🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 39,938 4,828 Updated Feb 6, 2026

EMI-Group / evorl

EvoRL is a fully GPU-accelerated framework for Evolutionary Reinforcement Learning, implemented with JAX. It supports Reinforcement Learning (RL), Evolutionary Computation (EC), Evolution-guided Re…

Python 260 36 Updated Feb 11, 2026

THUDM / AgentRL

Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework

Python 223 14 Updated Jan 17, 2026

verl-project / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 19,384 3,291 Updated Feb 26, 2026

chengq1001 / RRHF-V

[COLING'25] RRHF-V: Ranking Responses to Mitigate Hallucinations in Multimodal Large Language Models with Human Feedback

Python 3 Updated Sep 28, 2025

THU-KEG / Agentic-Reward-Modeling

[ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems

Python 125 7 Updated Jun 11, 2025

Lx-Bao / EMRC

Python 4 Updated Aug 14, 2025

snap-stanford / Biomni

Biomni: a general-purpose biomedical AI agent

Python 2,693 475 Updated Feb 23, 2026

giteehubby / Fei_shoter

对飞书笔记进行截图，以及生成文案用于发小红书

Python 2 Updated Aug 14, 2025

microsoft / agent-lightning

The absolute trainer to light up AI agents.

Python 15,182 1,290 Updated Feb 11, 2026

dw-dengwei / daily-arXiv-ai-enhanced

Automatically crawl arXiv papers daily and summarize them using AI. Illustrating them using GitHub Pages.

JavaScript 2,385 829 Updated Feb 26, 2026

liyc-sys / LUFFY-POOL

带有双池子的混合策略强化学习，基于LUFFY开发

Python 1 Updated Aug 3, 2025

Sun-Haoyuan23 / Awesome-RL-based-Reasoning-MLLMs

This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!

1,354 60 Updated Dec 7, 2025