Skip to content
View thss15fyt's full-sized avatar
🎮
Work Hard and Play Harder
🎮
Work Hard and Play Harder
  • Tsinghua University
  • Beijing

Organizations

@iMoonLab

Block or report thss15fyt

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 5,651 395 Updated Dec 13, 2024

A plugin from ECMWF/ai models, with models sourced from PuYun Large AI-based Meteorological Model in Macarbon (Hangzhou)

Python 10 Updated Nov 18, 2024
Python 5,393 892 Updated Dec 9, 2024
Python 1,725 124 Updated Nov 8, 2024

Official implementations for paper: Zero-shot Image Editing with Reference Imitation

Python 1,160 83 Updated Jun 15, 2024
Python 126 5 Updated Sep 29, 2024

Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding

Python 565 61 Updated Dec 10, 2024

Official repository for the paper PLLaVA

Python 612 43 Updated Jul 28, 2024

[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting

Python 5,740 669 Updated Sep 18, 2024
Python 219 16 Updated Apr 10, 2024

🎥 Python and OpenCV-based scene cut/transition detection program & library.

Python 3,378 406 Updated Dec 12, 2024

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

Python 15,737 1,855 Updated Jun 27, 2024

Better Aligning Text-to-Image Models with Human Preference. ICCV 2023

Python 269 9 Updated Jul 14, 2023

Image to prompt with BLIP and CLIP

Python 2,725 430 Updated May 15, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 35,855 4,167 Updated Dec 13, 2024

Let us control diffusion models!

Python 30,899 2,777 Updated Feb 25, 2024

Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch

Python 11,179 1,088 Updated May 11, 2024

A collection of resources and papers on Diffusion Models

HTML 11,229 951 Updated Aug 1, 2024

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 26,454 3,360 Updated Jul 23, 2024

Code for robust monocular depth estimation described in "Ranftl et. al., Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer, TPAMI 2022"

Python 4,575 635 Updated Aug 23, 2024

EVA Series: Visual Representation Fantasies from BAAI

Python 2,341 168 Updated Aug 1, 2024

2024 up-to-date list of DATASETS, CODEBASES and PAPERS on Multi-Task Learning (MTL), from Machine Learning perspective.

685 53 Updated Nov 4, 2024

CLIP Itself is a Strong Fine-tuner: Achieving 85.7% and 88.0% Top-1 Accuracy with ViT-B and ViT-L on ImageNet

Python 210 8 Updated Dec 16, 2022

General Vision Benchmark, GV-B, a project from OpenGVLab

Python 189 12 Updated Feb 23, 2022

Official repository for the General Robust Image Task (GRIT) Benchmark

Jupyter Notebook 50 7 Updated Mar 29, 2023

[ICCV 2023 & AAAI 2023] Binary Adapters & FacT, [Tech report] Convpass

Python 174 8 Updated Aug 1, 2023

Pytorch implementation for Semantic Segmentation/Scene Parsing on MIT ADE20K dataset

Python 4,962 1,101 Updated Jan 15, 2024

❄️🔥 Visual Prompt Tuning [ECCV 2022] https://arxiv.org/abs/2203.12119

Python 1,063 92 Updated Sep 2, 2023
Next