Stars
TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.
Painter & SegGPT Series: Vision Foundation Models from BAAI
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
EndoGaussian: Real-time Gaussian Splatting for Dynamic Endoscopic Scene Reconstruction
[NeurIPS 2024] Classification Done Right for Vision-Language Pre-Training
Strong and Open Vision Language Assistant for Mobile Devices
Official PyTorch implementation of GroupViT: Semantic Segmentation Emerges from Text Supervision, CVPR 2022.
Stable-Hair: Real-World Hair Transfer via Diffusion Model
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
[ECCV2024] ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation
A curated publication list on open vocabulary semantic segmentation and related area (e.g. zero-shot semantic segmentation) resources..
📚 A collection of papers about Referring Image Segmentation.
[CVPR 2024] The repository contains the official implementation of "Open-Vocabulary Segmentation with Semantic-Assisted Calibration"
Collection of awesome parameter-efficient fine-tuning resources.
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
[CVPR 2024] SAI3D: Segment Any Instance in 3D Scenes
The official implementation of 3DDFA_V3 in CVPR2024 (Highlight).
Official implementation of CVPR2024 paper "Enhance Image Classification via Inter-class Image Mixup with Diffusion Model""
The official implementation of SAGA (Segment Any 3D GAussians)
[ACM MM 2024] The official repo for "DreamLCM: Towards High-Quality Text-to-3D Generation via Latent Consisitency Model"
CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method
A plug-in for stable diffusion webUI. Users can perform region/object level image manipulation, including object addition, removal, and attribute modification.
[CVPR2024] Open-world Semantic Segmentation Including Class Similarity
High performance self-hosted photo and video management solution.
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
☄️ AirPods desktop user experience enhancement program, for Windows and Linux (WIP)
Official Code for DiffMorpher: Unleashing the Capability of Diffusion Models for Image Morphing (CVPR 2024)
The official repository of our CVPR2023 paper "FFHQ-UV: Normalized Facial UV-Texture Dataset for 3D Face Reconstruction".
🏡 Structure-from-Motion (SfM) and Multi-View Stereo (MVS)