Skip to content
View sannshu's full-sized avatar

Block or report sannshu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 1,703 110 Updated Dec 13, 2024

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 2,178 120 Updated Dec 13, 2024
Python 347 25 Updated Nov 26, 2024

A minimal and universal controller for FLUX.1.

Python 877 46 Updated Dec 10, 2024

Rust implementation of Ultralytics YOLOv8/v10 using ONNX (ort)

Rust 21 2 Updated Oct 7, 2024

Fast ML inference & training for Rust with ONNX Runtime

Rust 969 105 Updated Dec 11, 2024

Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"

Python 5,869 345 Updated Dec 5, 2024

基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.

Python 4,758 622 Updated Oct 23, 2024

Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System

Python 2,198 179 Updated Dec 12, 2024

Try CoreML models on multiple images and videos easily and quickly

Swift 151 10 Updated Feb 11, 2024

Header only C++ AES cipher library

C++ 197 44 Updated Jun 21, 2024

Training-free Regional Prompting for Diffusion Transformers 🔥

Python 450 19 Updated Nov 28, 2024

free C++ class library of cryptographic schemes

C++ 4,930 1,515 Updated Aug 1, 2024

Official inference framework for 1-bit LLMs

C++ 12,320 859 Updated Nov 11, 2024

🚀 Next Generation AI One-Stop Internationalization Solution. 🚀 下一代 AI 一站式 B/C 端解决方案,支持 OpenAI,Midjourney,Claude,讯飞星火,Stable Diffusion,DALL·E,ChatGLM,通义千问,腾讯混元,360 智脑,百川 AI,火山方舟,新必应,Gemini,Moonshot …

TypeScript 7,486 965 Updated Dec 8, 2024

SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree

Jupyter Notebook 518 15 Updated Dec 9, 2024

Various AI scripts. Mostly Stable Diffusion stuff.

Python 3,600 387 Updated Nov 29, 2024

Official repository of In-Context LoRA for Diffusion Transformers

1,316 65 Updated Nov 17, 2024

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 5,161 400 Updated Dec 11, 2024

Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".

Python 843 55 Updated Oct 28, 2024

DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception

Python 605 45 Updated Dec 5, 2024

Code for the paper Breaking reCAPTCHAv2 accepted at COMPSAC 2024

Python 261 41 Updated Oct 23, 2024

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 1,586 77 Updated Dec 10, 2024

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

C++ 14,971 2,956 Updated Dec 13, 2024

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 6,304 547 Updated Dec 8, 2024

[NeurIPS 2024] VFIMamba: Video Frame Interpolation with State Space Models

Python 69 7 Updated Sep 26, 2024
Next