-
Beihang University
- beijing
-
04:20
(UTC -12:00) - https://www.buaa.edu.cn/
Starred repositories
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation
A collection of awesome video generation studies.
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation (AAAI 2025 Oral)
we want to create a repo to illustrate usage of transformers in chinese
总结梳理自然语言处理工程师(NLP)需要积累的各方面知识,包括面试题,各种基础知识,工程能力等等,提升核心竞争力
Reading list for research topics in multimodal machine learning
[ICML 2024] MagicPose(also known as MagicDance): Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion
✨✨Latest Advances on Multimodal Large Language Models
A collection of resources on controllable generation with text-to-image diffusion models.
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥
Unofficial Implementation of Animate Anyone
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
U-Mamba: Enhancing Long-range Dependency for Biomedical Image Segmentation
A simple and efficient Mamba implementation in pure PyTorch and MLX.
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
Open-Sora: Democratizing Efficient Video Production for All
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.