Stars
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation
Generative Models by Stability AI
Official inference repo for FLUX.1 models
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
Offical codes for "GoMAvatar: Efficient Animatable Human Modeling from Monocular Video Using Gaussians-on-Mesh"
[CVPR 2024] The official repo for FlashAvatar
[CVPR2024] Official implementation of SplattingAvatar.
State-of-the-art 2D and 3D Face Analysis Project
An open-source platform for developing protein models beyond AlphaFold.
Multilingual Voice Understanding Model
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
I2V-Adapter: A General Image-to-Video Adapter for Diffusion Models
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
The repo provides information about KeSpeech dataset.
A generative speech model for daily dialogue.
Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series
Fit 3DMM to front and side face images simultaneously.
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
Python tools for 3D face: 3DMM, Mesh processing(transform, camera, light, render), 3D face representations.
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction of AlphaFold 2