-
University of Trento
- Trento, Italy
-
06:43
(UTC -12:00) - xusy2333.com
- @xusy2333
- in/shiyao-xu-784102173
Highlights
Lists (2)
Sort Name ascending (A-Z)
Stars
You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
[ ECCV 2024 ] MotionLCM: This repo is the official implementation of "MotionLCM: Real-time Controllable Motion Generation via Latent Consistency Model"
The official repo for the paper "VeCLIP: Improving CLIP Training via Visual-enriched Captions"
The official PyTorch implementation of the paper "Human Motion Diffusion Model"
(CVPR 2023) Pytorch implementation of “T2M-GPT: Generating Human Motion from Textual Descriptions with Discrete Representations”
Unofficial Implementation of Animate Anyone
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
Visualization code for SMPL body model Family
[CVPR 2024] Official Implementation of "Seamless Human Motion Composition with Blended Positional Encodings".
[BMVC 2024] Official Implementation of the paper guided attention for interpretable motion captioning
ECCV 2024: Controllable Motion Generation through Language Guided Pose Code Editing
The official PyTorch implementation of the paper "MotionGPT: Finetuned LLMs are General-Purpose Motion Generators"
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
MotionFix: Text-Driven 3D Human Motion Editing [SIGGRAPH ASIA 2024]
A suite of image and video neural tokenizers
Official PyTorch implementation of the paper "TEACH: Temporal Action Compositions for 3D Humans" [3DV 2022]
[NCA] Official implementation of the paper Motion2Language, Unsupervised learning of synchronized semantic motion segmentation
The Scene Language: Representing Scenes with Programs, Words, and Embeddings (arXiv preprint)
Downstream semantic segmentation evaluation of DGInStyle.
[ECCV 2024] DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control
[CVPR 2023] Executing your Commands via Motion Diffusion in Latent Space, a fast and high-quality motion diffusion model
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
[Arxiv-2024] MotionLLM: Understanding Human Behaviors from Human Motions and Videos