Skip to content
View zhaohengyuan1's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report zhaohengyuan1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
27 stars written in Jupyter Notebook
Clear filter

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 48,587 5,740 Updated Sep 18, 2024

Google Research

Jupyter Notebook 34,716 7,989 Updated Jan 18, 2025

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 27,035 3,411 Updated Jul 23, 2024

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 15,607 1,437 Updated Sep 5, 2024

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 10,178 991 Updated Nov 18, 2024

OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image genera…

Jupyter Notebook 7,037 1,068 Updated Aug 6, 2024

Open-source and strong foundation image recognition models.

Jupyter Notebook 3,036 282 Updated Aug 1, 2024

Codes for "Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models".

Jupyter Notebook 1,107 91 Updated Dec 23, 2023

VOLO: Vision Outlooker for Visual Recognition

Jupyter Notebook 934 95 Updated Sep 18, 2022

Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.

Jupyter Notebook 847 47 Updated Jan 20, 2025

[NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web"

Jupyter Notebook 759 105 Updated Jul 30, 2024
Jupyter Notebook 713 151 Updated Apr 30, 2022

Official repository for the paper "High-Resolution Daytime Translation Without Domain Labels" (CVPR2020, Oral)

Jupyter Notebook 650 86 Updated Feb 15, 2023

[NeurIPS 2020] This project provides a strong single-stage baseline for Long-Tailed Classification, Detection, and Instance Segmentation (LVIS). It is also a PyTorch implementation of the NeurIPS 2…

Jupyter Notebook 562 71 Updated Aug 11, 2024

PyTorch implementation of MAML: https://arxiv.org/abs/1703.03400

Jupyter Notebook 555 126 Updated Oct 4, 2018

Pytorch implementation of "All Tokens Matter: Token Labeling for Training Better Vision Transformers"

Jupyter Notebook 426 36 Updated Sep 5, 2023

MathVista: data, code, and evaluation for Mathematical Reasoning in Visual Contexts

Jupyter Notebook 264 42 Updated Nov 29, 2024

[NeurIPS 2023 D&B] VidChapters-7M: Video Chapters at Scale

Jupyter Notebook 185 21 Updated Nov 13, 2023
Jupyter Notebook 159 10 Updated Jul 5, 2024

[ICCV 2023] Code for "Not All Features Matter: Enhancing Few-shot CLIP with Adaptive Prior Refinement"

Jupyter Notebook 142 12 Updated Apr 21, 2024

Code for the Paper: Antonino Furnari and Giovanni Maria Farinella. What Would You Expect? Anticipating Egocentric Actions with Rolling-Unrolling LSTMs and Modality Attention. International Conferen…

Jupyter Notebook 133 33 Updated Aug 23, 2023

[ECCV 2022] A generalized long-tailed challenge that incorporates both the conventional class-wise imbalance and the overlooked attribute-wise imbalance within each class. The proposed IFL together…

Jupyter Notebook 121 8 Updated Aug 11, 2024

[ICCV 2021 Oral + TPAMI] Just Ask: Learning to Answer Questions from Millions of Narrated Videos

Jupyter Notebook 118 15 Updated Sep 29, 2023

BotSIM - a data-efficient end-to-end Bot SIMulation toolkit for evaluation, diagnosis, and improvement of commercial chatbots

Jupyter Notebook 114 8 Updated Jun 12, 2023

Official repository of DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models

Jupyter Notebook 79 5 Updated Sep 3, 2024

Temporally Consistent Video Colorization with Deep Feature Propagation and Self-regularization Learning

Jupyter Notebook 48 13 Updated Oct 28, 2024

FathomNet's out-of-sample detection challenge in association with FGVC 2023

Jupyter Notebook 8 1 Updated Apr 18, 2023