Skip to content
View prelife's full-sized avatar

Block or report prelife

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Taming Stable Diffusion for Lip Sync!

Python 1,878 216 Updated Jan 17, 2025

DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion

Python 1,155 68 Updated Dec 7, 2024

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 2,025 143 Updated Jan 17, 2025
Jupyter Notebook 136 6 Updated Jan 17, 2025

LTX-Video Support for ComfyUI

Python 638 41 Updated Dec 22, 2024

The best OSS video generation models

Python 2,723 280 Updated Jan 8, 2025

Run ComfyUI workflows on multiple local GPUs/networked machines.

Python 391 35 Updated May 22, 2024

A zero dependency web UI for any LLM backend, including KoboldCpp, OpenAI and AI Horde

HTML 95 47 Updated Jan 17, 2025

You can using EchoMimic in ComfyUI

Python 504 48 Updated Jan 16, 2025

SLOP Detector and analyzer based on dictionary for shareGPT JSON and text

Python 49 4 Updated Nov 2, 2024

LLM-powered multiagent persona simulation for imagination enhancement and business insights.

Python 5,274 424 Updated Jan 3, 2025

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

1,788 231 Updated Jun 6, 2024

The world's simplest facial recognition api for Python and the command line

Python 53,943 13,534 Updated Aug 21, 2024

SOTA Open Source TTS

Python 18,412 1,383 Updated Jan 17, 2025
Python 743 63 Updated Nov 11, 2024

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 8,073 619 Updated Jan 15, 2025

Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.

Python 975 54 Updated Jan 2, 2025

[CVPR 2023] Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolatio

Python 429 43 Updated May 29, 2023

PuLID native implementation for ComfyUI

Python 767 48 Updated Oct 5, 2024

Code of Pyramidal Flow Matching for Efficient Video Generative Modeling

Python 2,704 266 Updated Dec 21, 2024

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 8,989 1,192 Updated Jan 15, 2025

Fast inference engine for Transformer models

C++ 3,528 313 Updated Dec 18, 2024
Python 9,974 1,278 Updated Jan 17, 2025

[SIGGRAPH ASIA 2024 TCS] AnimateLCM: Computation-Efficient Personalized Style Video Generation without Personalized Video Data

Python 622 47 Updated Oct 22, 2024

Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models

Python 632 18 Updated Sep 18, 2024

Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation

Python 489 27 Updated Sep 16, 2024
Python 67 2 Updated Nov 2, 2024

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

Python 29,278 3,667 Updated Aug 6, 2024

Everything-Reactivity in ComfyUI (audio, MIDI, motion, proximity, and more).

Python 365 22 Updated Jan 13, 2025
Next