Skip to content
View xiangweifeng's full-sized avatar
🍊
🍊
  • 北京
  • 10:19 (UTC +08:00)

Block or report xiangweifeng

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

✨ 重要

10 repositories

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,716 1,034 Updated Dec 13, 2024

Stable Video Diffusion Training Code and Extensions.

Python 622 62 Updated Jul 25, 2024

Memory optimized finetuning scripts for CogVideoX & Mochi using TorchAO and DeepSpeed

Python 512 48 Updated Dec 5, 2024

GPT4V-level open-source multi-modal model based on Llama3-8B

Python 2,164 147 Updated Sep 3, 2024

📹 A more flexible CogVideoX that can generate videos at any resolution and creates videos from images.

Python 549 38 Updated Dec 13, 2024

📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion

Python 1,582 113 Updated Dec 12, 2024

The best OSS video generation models

Python 2,432 247 Updated Dec 6, 2024

Tarsier -- a family of large-scale video-language models, which is designed to generate high-quality video descriptions , together with good capability of general video understanding.

Python 161 9 Updated Dec 10, 2024

Official repository for the paper PLLaVA

Python 612 43 Updated Jul 28, 2024

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 9,706 918 Updated Dec 13, 2024