Skip to content
View torphix's full-sized avatar

Highlights

  • Pro

Block or report torphix

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Riona 🌸 is built with Node.js and TypeScript 🛠️. Designed to run jobs 📸 effortlessly. Lightweight, efficient, and a work in progress 🚧—more features coming soon! 🌟

TypeScript 2,169 306 Updated Jan 16, 2025

Caption Markup Language

Python 19 1 Updated Jan 3, 2023

[arXiv 2025] Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control

363 7 Updated Jan 12, 2025

✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 1,939 137 Updated Jan 17, 2025

A Lightweight Recommendation System

Python 7,083 538 Updated Nov 8, 2023
JavaScript 7 3 Updated Dec 16, 2024

A Smart, Automatic, Fast and Lightweight Web Scraper for Python

Python 6,587 684 Updated Oct 12, 2024

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 3,084 295 Updated Nov 5, 2024

Joint speech-language model - respond directly to audio!

Python 365 33 Updated Jul 1, 2024

Passport Visa API

TypeScript 30 2 Updated Nov 24, 2024
Python 146 8 Updated Sep 5, 2024

EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning

Python 3,439 393 Updated Dec 10, 2024

Efficient and Scalable Implementations of Clustering Algorithms using Pytorch.

Python 5 Updated Aug 22, 2024

Vector (and Scalar) Quantization, in Pytorch

Python 2,838 231 Updated Jan 10, 2025

Bring portraits to life!

Python 13,664 1,460 Updated Jan 1, 2025

The official code of our ICCV2023 work: Implicit Identity Representation Conditioned Memory Compensation Network for Talking Head video Generation

Python 249 22 Updated Oct 5, 2023

Simplest working implementation of Stylegan2, state of the art generative adversarial network, in Pytorch. Enabling everyone to experience disentanglement

Python 3,728 590 Updated Jan 12, 2025

[NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.

Python 270 7 Updated Jul 9, 2024

MARS5 speech model (TTS) from CAMB.AI

Jupyter Notebook 2,591 215 Updated Aug 1, 2024

A generative speech model for daily dialogue.

Python 33,751 3,661 Updated Jan 13, 2025

Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.

Python 1,213 77 Updated Nov 27, 2024

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Python 8,148 1,072 Updated Sep 14, 2024

Command-line program to download videos from YouTube.com and other video sites

Python 133,688 10,176 Updated Jan 15, 2025

LVDM: Latent Video Diffusion Models for High-Fidelity Long Video Generation

Python 464 18 Updated Nov 16, 2024

A fast multimodal LLM for real-time voice

Python 2,882 180 Updated Jan 14, 2025

Latte: Latent Diffusion Transformer for Video Generation.

Python 1,757 183 Updated Sep 28, 2024

⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation (AAAI 2025)

Python 519 39 Updated Jan 18, 2025

Locally Hierarchical Auto-Regressive Modeling for Image Generation (HQ-Transformer)

Jupyter Notebook 27 2 Updated Feb 14, 2024

[NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models

Jupyter Notebook 258 9 Updated Dec 4, 2024
Next