-
Stanford University
- Bay Area | NYC
- https://www.zhiz.dev/
- @zhizdev
Stars
Easily train a good VC model with voice data <= 10 mins!
the AI-native open-source embedding database
Annotated Flow Matching paper
[TPAMI 2023] SceneDreamer: Unbounded 3D Scene Generation from 2D Image Collections
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Official implementation of SyncTweedies: A General Generative Framework Based on Synchronized Diffusions (NeurIPS 2024)
⚡️ Firebase plugins for Capacitor. Supports Android, iOS and the Web.
📱 A template for your local-first Expo project: Bun, Expo 51, TypeScript, TailwindCSS, DrizzleORM, Sqlite, EAS, GitHub Actions, Env Vars, expo-router, react-hook-form.
[3DV 2025] LoopSplat: Loop Closure by Registering 3D Gaussian Splats
My starter templates for building apps with react native and expo
[NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models
Minimal implementation of scalable rectified flow transformers, based on SD3's approach
TorchCFM: a Conditional Flow Matching library
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
[IJCV] FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention
Lora beYond Conventional methods, Other Rank adaptation Implementations for Stable diffusion.
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
Character Animation (AnimateAnyone, Face Reenactment)
Understand Human Behavior to Align True Needs
Official inference repo for FLUX.1 models
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
Just a helper script for invoking kohya converter (and maybe a cheeky inferencer to check it worked okay)
WebUI extension for ControlNet
Generative Models by Stability AI
OpenMMLab Text Detection, Recognition and Understanding Toolbox