Skip to content
View huzongxiang's full-sized avatar
  • Peking university
  • beijing

Block or report huzongxiang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 15,678 1,443 Updated Sep 5, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 13,931 1,407 Updated Dec 25, 2024

Framework agnostic sliced/tiled inference + interactive ui + error analysis plots

Python 4,282 620 Updated Dec 18, 2024

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 66,173 7,073 Updated Feb 7, 2025

Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2

Jupyter Notebook 1,606 158 Updated Dec 21, 2024

Label Studio is a multi-type data labeling and annotation tool with standardized output format

JavaScript 20,620 2,527 Updated Feb 7, 2025

FastVideo is a lightweight framework for accelerating large video diffusion models.

Python 966 59 Updated Feb 7, 2025

"LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 11,576 1,606 Updated Feb 7, 2025

Fast and memory-efficient exact attention

Python 15,333 1,443 Updated Feb 4, 2025

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 2,473 147 Updated Feb 7, 2025

✨✨Latest Advances on Multimodal Large Language Models

13,759 889 Updated Jan 28, 2025

Official Code for Stable Cascade

Jupyter Notebook 6,577 531 Updated Jul 25, 2024

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 18,180 1,303 Updated Jan 27, 2025

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Jupyter Notebook 7,163 467 Updated Nov 6, 2024

中文大模型能力评测榜单:目前已囊括164个大模型,覆盖chatgpt、gpt-4o、谷歌gemini、Claude3.5、百度文心一言、千问、百川、讯飞星火、商汤senseChat、minimax等商用模型, 以及deepseek-v3、qwen2.5、llama3.3、phi-4、glm4、书生internLM2.5等开源大模型。不仅提供能力评分排行榜,也提供所有模型的原始输出结果!

3,410 153 Updated Jan 29, 2025

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 138,754 27,850 Updated Feb 7, 2025

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 21,318 2,345 Updated Aug 12, 2024

《ChatGPT原理与实战:大型语言模型的算法、技术和私有化》

Python 346 66 Updated Dec 9, 2023

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 171,194 44,991 Updated Feb 7, 2025

Drag & drop UI to build your customized LLM flow

TypeScript 34,800 18,105 Updated Feb 6, 2025

WeChat SDK for Python

Python 3,969 815 Updated Feb 4, 2025

A community-maintained Python framework for creating mathematical animations.

Python 29,717 2,085 Updated Feb 4, 2025

openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 275+ supported cars.

Python 51,932 9,380 Updated Feb 7, 2025

Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.

TypeScript 27,765 2,556 Updated Feb 7, 2025

Tracking and collecting papers/projects/others related to Segment Anything.

1,572 132 Updated Feb 7, 2025

[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception

Python 502 31 Updated May 8, 2024

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 10,212 993 Updated Nov 18, 2024

[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"

Python 4,470 416 Updated Aug 19, 2024

[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"

Python 2,464 125 Updated Jul 19, 2024

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 7,320 739 Updated Aug 12, 2024
Next