Skip to content
View wuyujack's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report wuyujack

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 20,021 1,673 Updated Nov 26, 2025

Lightweight coding agent that runs in your terminal

Rust 54,666 6,954 Updated Dec 25, 2025

ModelScope: bring the notion of Model-as-a-Service to life.

Python 8,594 894 Updated Dec 23, 2025

Analyzing Hacker News discussions from a decade ago in hindsight with LLMs

Python 501 49 Updated Dec 10, 2025

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,687 76 Updated May 11, 2025

MCP-Bench: Benchmarking Tool-Using LLM Agents with Complex Real-World Tasks via MCP Servers

Python 413 46 Updated Oct 7, 2025

Lumina-DiMOO - An Open-Sourced Multi-Modal Large Diffusion Language Model

Python 916 57 Updated Dec 23, 2025

Official PyTorch implementation for "Large Language Diffusion Models"

Python 3,428 231 Updated Nov 12, 2025

MMaDA - Open-Sourced Multimodal Large Diffusion Language Models

Python 1,542 78 Updated Nov 16, 2025

Official Implementation of "MMaDA-Parallel: Multimodal Large Diffusion Language Models for Thinking-Aware Editing and Generation"

Python 281 7 Updated Nov 19, 2025

Cosmos-Reason1 models understand the physical common sense and generate appropriate embodied decisions in natural language through long chain-of-thought reasoning processes.

Python 863 74 Updated Dec 20, 2025

NanoGPT (124M) in 3 minutes

Python 4,007 532 Updated Dec 25, 2025

GLM-4.6V/4.5V/4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Python 2,090 141 Updated Dec 18, 2025

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 12,774 1,180 Updated Sep 26, 2025

An open-source AI agent that lives in your terminal.

TypeScript 16,740 1,440 Updated Dec 25, 2025

🚀🚀 Efficient implementations of Native Sparse Attention

Python 1,043 12 Updated Sep 29, 2025

[CoRL 2025] TWIST: Teleoperated Whole-Body Imitation System

Python 637 63 Updated Nov 1, 2025

The best ChatGPT that $100 can buy.

Python 39,261 4,978 Updated Dec 23, 2025

Official repository for the Boltz biomolecular interaction models

Python 3,555 708 Updated Oct 3, 2025

Training API and CLI

Python 271 29 Updated Dec 15, 2025

Model Merging with Functional Dual Anchors

Python 44 3 Updated Nov 23, 2025

NEO Series: Native Vision-Language Models from First Principles

Python 598 20 Updated Dec 17, 2025

Large multi-modal models (L3M) pre-training.

Python 223 13 Updated Sep 22, 2025

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 4,445 434 Updated Oct 27, 2025

Reproducing R1 for Code with Reliable Rewards

Python 278 16 Updated May 5, 2025
Python 126 12 Updated Nov 24, 2025

Post-training with Tinker

Python 2,616 262 Updated Dec 25, 2025

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 3,164 193 Updated Oct 9, 2025

[NeurIPS 2025]《SD-VLM: Spatial Measuring and Understanding with Depth-encoded Vision Language Models》

Python 30 3 Updated Nov 14, 2025
Next