-
University of Washington
- Seattle, US
-
13:10
(UTC -12:00) - https://rese1f.github.io/
- @wenhaocha1
- rese1f
- in/wenhao-chai-658274238
- https://scholar.google.com/citations?user=SL--7UMAAAAJ
Highlights
- Pro
-
arxiv-daily Public
Forked from beiyuouo/arxiv-dailyπ Automatically Update Some Fields Papers Daily using Github Actions (Update Every 12th hours)
-
-
MovieChat Public
[CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understanding
-
Apollo Public
Apollo is a family of LMMs designed for video understanding
-
MMAudio Public
Forked from hkchengrex/MMAudio[arXiv 2024] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
Python MIT License UpdatedDec 8, 2024 -
lobe-icons Public
Forked from lobehub/lobe-iconsπ₯¨ Lobe Icons - Popular AI / LLM Model Brand SVG Logo and Icon Collection.
TypeScript MIT License UpdatedNov 24, 2024 -
-
samurai Public
Forked from yangchris11/samuraiJupyter Notebook Apache License 2.0 UpdatedNov 20, 2024 -
StableV2V Public
Forked from AlonzoLeeeooo/StableV2VThe official implementation of the paper titled "StableV2V: Stablizing Shape Consistency in Video-to-Video Editing".
Python MIT License UpdatedNov 19, 2024 -
aurora Public
π₯ Aurora Series: A more efficient multimodal large language model series for video.
-
embodied-agent-interface.github.io Public
Forked from embodied-agent-interface/embodied-agent-interface.github.ioThis is the project website for the paper "Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making".
-
lmms-eval Public
Forked from EvolvingLMMs-Lab/lmms-evalAccelerating the development of large multimodal models (LMMs) with lmms-eval
Python Other UpdatedOct 26, 2024 -
google-scholar-badge Public
Forked from dexhunter/google-scholar-badgeTrack google scholar page and get your citation number on a badge
Python MIT License UpdatedSep 30, 2024 -
transformers Public
Forked from huggingface/transformersπ€ Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedSep 25, 2024 -
diffusers Public
Forked from huggingface/diffusersπ€ Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
Python Apache License 2.0 UpdatedSep 7, 2024 -
xtuner Public
Forked from InternLM/xtunerAn efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
Python Apache License 2.0 UpdatedAug 21, 2024 -
cambrian Public
Forked from cambrian-mllm/cambrianCambrian-1 is a family of multimodal LLMs with a vision-centric design.
Python Apache License 2.0 UpdatedAug 3, 2024 -
Awesome-VQVAE Public
A collection of resources and papers on Vector Quantized Variational Autoencoder (VQ-VAE) and its application
-
minisora Public
Forked from mini-sora/minisoraThe Mini Sora project aims to explore the implementation path and future development direction of Sora.
UpdatedFeb 28, 2024 -
Awesome-MLLM-Hallucination Public
Forked from showlab/Awesome-MLLM-Hallucinationπ A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).
UpdatedJan 18, 2024 -
STEVE Public
[ECCV 2024] STEVE in Minecraft is for See and Think: Embodied Agent in Virtual Environment
-
DriveLM Public
Forked from OpenDriveLab/DriveLMDriveLM: Drive on Language
HTML Apache License 2.0 UpdatedDec 22, 2023 -
Awesome-LLM-3D Public
Forked from ActiveVisionLab/Awesome-LLM-3DAwesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources
-
old_web Public
Forked from desh2608/desh2608.github.iopersonal website built on beautiful jekyll, feel free to clone and modify
-
CityGen Public
ποΈππ Try Infinite and Controllable 3D City Layout Generation!
-
UniAP Public
[AAAI 2024] UniAP: Towards Universal Animal Perception in Vision via Few-shot Learning
-
ipl-uw.github.io Public
Forked from ipl-uw/ipl-uw.github.ioWebsite for IPL
HTML UpdatedNov 27, 2023 -
LLM-Agent-Paper-List Public
Forked from WooooDyy/LLM-Agent-Paper-ListThe paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
UpdatedNov 7, 2023 -
Awesome-DriveLM Public
π A collection of resources and papers on Large Language Models in autonomous driving
-
UniVHP Public
Unified Human-centric Perception Model and Benchmark in Sports