Tarsier -- a family of large-scale video-language models, which is designed to generate high-quality video descriptions , together with good capability of general video understanding.

Python 161 9 Updated Dec 10, 2024

magic-research / PLLaVA

Official repository for the paper PLLaVA

Python 612 43 Updated Jul 28, 2024

THUDM / CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 9,706 918 Updated Dec 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

冯祥卫 xiangweifeng

Achievements

Achievements

Block or report xiangweifeng

✨ 重要

PKU-YuanGroup / Open-Sora-Plan

pixeli99 / SVD_Xtend

a-r-r-o-w / cogvideox-factory

THUDM / CogVLM2

aigc-apps / CogVideoX-Fun

aigc-apps / EasyAnimate

genmoai / mochi

bytedance / tarsier

magic-research / PLLaVA

THUDM / CogVideo