AI 发展日新月异, 以下项目是目自 2024-11-23 起, 搜集整理的非常棒的项目/应用/资源...
后面新添加, 都会标注日期:
CreateAI 2025-01-14
MiniMax-与用户共创智能 2025-01-14
MiniPerplx 2025-01-14
Project IDX 2025-01-14
Dify Marketplace 2025-01-14
即创 - 一站式智能创意生产与管理平台 2025-01-14
通义万相_AI创意作画_AI绘画_人工智能-阿里云 2025-01-14
Qwen 2025-01-14
VITA-MLLM/VITA: ✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction 2025-01-14
wileewang/TransPixar 2025-01-14
Stability-AI/stable-point-aware-3d 2025-01-14
SagiPolaczek/NeuralSVG: Official implementation of NerualSVG 2025-01-14
ali-vilab/TeaCache: Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model 2025-01-14
wangzhiyaoo/SVFR: Official implementation of SVFR. 2025-01-14
Ebook2audiobook V2.0 Beta - a Hugging Face Space by drewThomasson 2025-01-14
LatentSync - a Hugging Face Space by fffiloni 2025-01-14
SeedVR 2025-01-14
FaceLift: Single Image to 3D Head with View Generation and GS-LRM 2025-01-14
Deepseek Artifacts - Experience the power of the world's best open source model. 2025-01-05
TypingMind — LLM Frontend Chat UI for AI models 2025-01-05
STORM 2025-01-05
CAD Software for Hardware Design | Zoo 2025-01-05
让计算更简单 | OpenBayes 贝式计算 2025-01-05
bytedance/LatentSync: Taming Stable Diffusion for Lip Sync! 2025-01-05
TangoFlux - a Hugging Face Space by declare-lab 2025-01-05
Kokoro TTS - a Hugging Face Space by hexgrad 2025-01-05
AniPortrait Official - a Hugging Face Space by ZJYang 2025-01-05
VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control 2025-01-05
CodeElo 2025-01-05
Lovable 2025-01-02
Monica - ChatGPT AI Assistant | GPT-4o, Claude 3.5, Gemini 1.5 2025-01-02
Voicenotes: Transcribe notes, meetings & ask AI 2025-01-02
YouMind - AI Creation System 2025-01-02
FreedomIntelligence/HuatuoGPT-o1: Medical o1, Towards medical complex reasoning with LLMs 2025-01-02
vllm-project/vllm: A high-throughput and memory-efficient inference and serving engine for LLMs 2025-01-02
modelscope/DiffSynth-Studio: Enjoy the magic of Diffusion models! 2025-01-02
OpenDriveLab/AgiBot-World: World's First Large-scale High-quality Robotic Manipulation Benchmark 2025-01-02
TMElyralab/MusePose: MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation 2025-01-02
SpatialVision/Orient-Anything 2025-01-02
MMAudio — generating synchronized audio from video/text - a Hugging Face Space by hkchengrex 2025-01-02
Anychat - a Hugging Face Space by akhaliq 2025-01-02
FacePoke - a Hugging Face Space by jbilcke-hf 2025-01-02
AI Comic Factory - a Hugging Face Space by jbilcke-hf 2025-01-02
Switti - a Hugging Face Space by dbaranchuk 2025-01-02
Dokdo Multimodal - a Hugging Face Space by ginipick 2025-01-02
Dokdo - a Hugging Face Space by ginigen 2025-01-02
Feat2GS 2025-01-02
GenHMR: Generative Human Mesh Recovery 2025-01-02
1.58-bit FLUX 2025-01-02
HSfM 2025-01-02
PERSE: Personalized 3D Generative Avatars from A Single Portrait 2025-01-02
VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control 2025-01-02
Project Odyssey 2024-12-30
在线运行 ComfyUI 工作流并一键部署 API - ComfyOnline 2024-12-30
智谱AI开放平台 2024-12-30
Replit – Build apps and sites with AI 2024-12-30
AIGCPanel | 开源AI数字人系统 2024-12-30
阶跃星辰开放平台 2024-12-30
Fireworks - Fastest Inference for Generative AI 2024-12-30
百川大模型-汇聚世界知识 创作妙笔生花-百川智能 2024-12-30
DomoAI | AI Art Generator & Video to Animation Converter 2024-12-30
CreateAI 2024-12-30
Magnific AI — The magic image Upscaler & Enhancer 2024-12-30
Odyssey 2024-12-30
Nexa AI | Enterprise-Grade On-Device AI for Every Device 2024-12-30
Humane Ai Pin | See the World, Not Your Screen. | Humane 2024-12-30
书生 2024-12-30
Taipy — Build Python Data & BI web applications 2024-12-30
facebookresearch/blt: Code for BLT research paper 2024-12-30
VideoVerses/VideoVAEPlus 2024-12-30
hpcaitech/Open-Sora: Open-Sora: Democratizing Efficient Video Production for All 2024-12-30
osanseviero/geminiCoder: Create apps with Gemini 2024-12-30
IamCreateAI/Ruyi-Models 2024-12-30
rasbt/LLMs-from-scratch: Implement a ChatGPT-like LLM in PyTorch from scratch, step by step 2024-12-30
SakanaAI/asal: Automating the Search for Artificial Life with Foundation Models! 2024-12-30
mingyuan-zhang/LMM: Large Motion Model for Unified Multi-Modal Motion Generation 2024-12-30
TencentARC/StereoCrafter: A framework to convert any 2D videos to immersive stereoscopic 3D 2024-12-30
THUDM/CogAgent: An open-sourced end-to-end VLM-based GUI Agent 2024-12-30
AriaUI/Aria-UI: Aria-UI: Visual Grounding for GUI Instructions 2024-12-30
modstart-lib/aigcpanel: AigcPanel 是一个简单易用的一站式AI数字人系统,支持视频合成、声音合成、声音克隆,简化本地模型管理、一键导入和使用AI模型。 2024-12-30
krystalan/DRT-o1: DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought 2024-12-30
zsyOAOA/InvSR: Arbitrary-steps Image Super-resolution via Diffusion Inversion 2024-12-30
livekit/agents: Build real-time multimodal AI applications 🤖🎙️📹 2024-12-30
baaivision/See3D: You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale 2024-12-30
Nutlope/picMenu: Visualize menus in seconds with AI 2024-12-30
Avaiga/taipy: Turns Data and AI algorithms into production-ready web applications in no time. 2024-12-30
QVQ 72B Preview - a Hugging Face Space by Qwen 2024-12-30
LuminaBrush - a Hugging Face Space by lllyasviel 2024-12-30
InvSR - a Hugging Face Space by OAOA 2024-12-30
Lifting Motion to the 3D World via 2D Diffusion 2024-12-30
Synthesizing Moving People with 3D Control 2024-12-30
MegaSaM 2024-12-30
Sketch2Sound 2024-12-30
INFP: Audio-Driven Interactive Head Generation in Dyadic Conversations 2024-12-30
From Slow Bidirectional to Fast Causal Video Generators 2024-12-30
Zenodo 2024-12-20
Whisk 2024-12-20
labs.google/fx 2024-12-20
无问芯穹一站式AI平台 2024-12-20
VideoLingo - AI Subtitles Translation 2024-12-20
RedAIGC/Flux-version-LayerDiffuse 2024-12-20
microsoft/markitdown: Python tool for converting files and office documents to Markdown. 2024-12-20
franciszzj/Leffa: Learning Flow Fields in Attention for Controllable Person Image Generation 2024-12-20
wzhouxiff/ObjCtrl-2.5D: ObjCtrl-2.5D 2024-12-20
tumurzakov/AnimateDiff: AnimationDiff with train 2024-12-20
IamCreateAI/Ruyi-Models 2024-12-20
Genesis-Embodied-AI/Genesis: A generative world for general-purpose robotics & embodied AI learning. 2024-12-20
Kedreamix/Linly-Dubbing: 智能视频多语言AI配音/翻译工具 - Linly-Dubbing — “AI赋能,语言无界” 2024-12-20
genmoai/mochi: The best OSS video generation models 2024-12-20
guoyww/AnimateDiff: Official implementation of AnimateDiff. 2024-12-20
BrushEdit - a Hugging Face Space by TencentARC 2024-12-20
TRELLIS - a Hugging Face Space by JeffreyXiang 2024-12-20
Motion Prompting: Controlling Video Generation with Motion Trajectories 2024-12-20
snap-research.github.io/wonderland/ 2024-12-20
X-Portrait 2: Highly Expressive Portrait Animation 2024-12-20
New Chat | glhf.chat 2024-12-15
edify-3d Model by Shutterstock | NVIDIA NIM 2024-12-15
豆包 MarsCode - 工作台 2024-12-15
Devin 2024-12-15
DeepSeek - 探索未至之境 2024-12-15
Sora 2024-12-15
DeepLearning.AI - Learning Platform 2024-12-15
D5渲染器官网 | 实时光追渲染技术,重塑3D创作工作流 2024-12-15
PromptPerfect - AI Prompt Generator and Optimizer 2024-12-15
Learn Prompting: Your Guide to Communicating with AI 2024-12-15
hacksider/Deep-Live-Cam: real time face swap and one-click video deepfake with only a single image 2024-12-15
datawhalechina/llm-cookbook: 面向开发者的 LLM 入门教程,吴恩达大模型系列课程中文版 2024-12-15
f/awesome-chatgpt-prompts: This repo includes ChatGPT prompt curation to use ChatGPT better. 2024-12-15
Stability-AI/stable-audio-tools: Generative models for conditional audio generation 2024-12-15
lihxxx/DisPose: This repository is the official implementation of DisPose 2024-12-15
fkryan/gazelle 2024-12-15
tdrussell/diffusion-pipe: A pipeline parallel training script for diffusion models. 2024-12-15
openai/openai-cookbook: Examples and guides for using the OpenAI API 2024-12-15
FlowEdit - a Hugging Face Space by fallenshock 2024-12-15
Project Astra - Google DeepMind 2024-12-15
Project Mariner - Google DeepMind 2024-12-15
Jules (Confidential) 2024-12-15
MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance 2024-12-15
SwiftEdit 2024-12-15
Michael Fischer 2024-12-15
Using Diffusion Priors for Video Amodal Segmentation 2024-12-15
Fish Audio: Free Generative AI Text To Speech & Voice Cloning 2024-12-09
Generative Foundation Model - Amazon Nova - AWS 2024-12-09
RunComfy: Top ComfyUI Platform - Fast & Easy, No Setup 2024-12-09
提示工程指南 | Prompt Engineering Guide 2024-12-09
Prompt Engineering Guide | Prompt Engineering Guide 2024-12-09
Hailuo AI Audio: Create lifelike speech 2024-12-09
FunAudioLLM/SenseVoice: Multilingual Voice Understanding Model 2024-12-09
yformer/EfficientTAM: Efficient Track Anything 2024-12-09
jingyaogong/minimind: 「大模型」3小时完全从0训练26M的小参数GPT,个人显卡即可推理训练! 2024-12-09
kijai/ComfyUI-HunyuanVideoWrapper 2024-12-09
memoavatar/memo: Memory-Guided Diffusion for Expressive Talking Video Generation 2024-12-09
CosyVoice-300M · 创空间 2024-12-09
ChatTTS Speaker - a Hugging Face Space by taa 2024-12-09
Flux Fill Outpainting - a Hugging Face Space by multimodalart 2024-12-09
Flux.1-dev Upscaler - a Hugging Face Space by jasperai 2024-12-09
Flux.1-dev Upscaler - a Hugging Face Space by Nymbo 2024-12-09
Muse 2024-12-09
Introducing Veo and Imagen 3 on Vertex AI | Google Cloud Blog 2024-12-09
FLOAT 2024-12-09
Genie 2: A large-scale foundation world model - Google DeepMind 2024-12-09
Digital Life Project 2024-12-09
I2VControl: Disentangled and Unified Video Motion Synthesis Control 2024-12-09
DualPM: Dual Posed-Canonical Point Maps for 3D Shape and Pose Reconstruction 2024-12-09
fugatto.github.io 2024-12-09
CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models 2024-12-09
SNOOPI: Supercharged One-step Diffusion Distillation with Proper Guidance 2024-12-09
vision-xl 2024-12-09
语鲸 2024-12-03
深言达意 – 找词找句 2024-12-03
爱校对官网-免费高效的错别字检查工具 2024-12-03
Learn About 2024-12-03
World Labs 2024-12-03
通义tongyi.ai_你的全能AI助手-通义千问 2024-12-03
天工AI - 搜索更深度,阅读更多彩 2024-12-03
讯飞星火大模型-AI大语言模型-星火大模型-科大讯飞 2024-12-03
文心一言 2024-12-03
Home • Hume AI 2024-12-03
Cohere | The leading AI platform for enterprise 2024-12-03
腾讯混元文生视频 2024-12-03
PixelDance - PixelDance AI - 领先的AI视频生成平台 2024-12-03
prs-eth/RollingDepth: Video Depth without Video Models 2024-12-03
Tencent/HunyuanVideo 2024-12-03
TryOffDiff - a Hugging Face Space by rizavelioglu 2024-12-03
Freditor 2024-12-03
纳米搜索 2024-12-02
书生·浦语 2024-12-02
Luma Dream Machine | AI Video Generator
VidAU Creative Center 2024-12-02
kaze.ai - AI-powered Free Online Removing Watermark and Logos Tool 2024-11-27
Hailuo AI Video Generator - Reimagine Video Creation
讲故事的方式发生了转变LTX工作室 --- Storytelling Transformed | LTX Studio
Genmo. Create videos and images with AI.
HeyGen - AI Spokesperson Video Creator
DomoAI: video to video, video to animation and more
Warpvideo AI: Change Video Style with AI
BoomCut - 爆剪辑 - 小影科技旗下 AI 内容创意产品与服务平台
Create stunning visuals in seconds with AI.
Remove Background from Image for Free – remove.bg
Logo-creator.io – Generate a logo
在线抠图软件_图片去除背景 | remove.bg – remove.bg
Meshy - Free 3D Models Generated from Images and Text
Immersity AI | Convert Image and Video to 3D
Free Text to Speech & AI Voice Generator | ElevenLabs
Udio AI Music Generator - Make Original Tracks in Seconds
在线免费文本转语音 - TTS-Online | 多种声音与二次元语音
Soundboard - TUNA - Download Unlimited Free Meme Sounds
Codeium · Free AI Code Completion & Chat 2024-12-02
NotebookLM | Note Taking & Research Assistant Powered by AI
LlamaOCR.com – Document to markdown
AnchorCrafter 2024-12-02
Generative Omnimatte: Learning to Decompose Video into Layers 2024-12-02
lehduong/OneDiffusion 2024-12-02
LipDub AI | The most realistic AI lip sync and video translation 2024-12-02
MyTimeMachine: Personalized Facial Age Transformation 2024-12-02
Buffer Anytime: Zero-Shot Video Depth and Normal from Image Priors 2024-12-02
MultiFoley 2024-12-02
Sonic: Shifting Focus to Global Audio Perception in Audio-driven Portrait Animation 2024-12-02
Fugatto, World’s Most Flexible Sound Machine, Debuts | NVIDIA Blog 2024-11-27
Fashion-VDM: Video Diffusion Model for Virtual Try-On
Inverse Painting: Reconstructing The Painting Process
首页 |剧集主管 --- Home | Showrunner
PersonaTalk: Bring Attention to Your Persona in Visual Dubbing
MarDini: Masked Auto-Regressive Diffusion for Video Generation at Scale -- Meta AI Research
loopyavatar.github.io/?ref=aihub.cn
URAvatar: Universal Relightable Gaussian Codec Avatars
Shakker - Generative AI design tool with diverse models
FREE online image generator and model hosting site! | Tensor.Art
Civitai: The Home of Open-Source Generative AI
FREE online image generator and model hosting site! | Tensor.Art
Cephalon Cloud 端脑云 - AIGC 应用平台
Discover and download free videos - Pixabay
AI工具集 | 700+ AI工具集合官网,国内外AI工具集导航大全
Supertools | Best AI Tools Guide
AIGC导航 | 1500+全品类AIGC创作工具_探索更多可能!
AI Model & API Providers Analysis | Artificial Analysis
Weird Wonderful AI Art | ART of the future - now!
hmrishavbandy/FlipSketch: FlipSketch: Flipping Static Drawings to Text-Guided Sketch Animations 2024-12-02
KwaiVGI/LivePortrait: Bring portraits to life! 2024-12-02
C0untFloyd/roop-unleashed: Evolved Fork of roop with Web Server and lots of additions 2024-12-02
jdh-algo/JoyVASA 2024-12-02
PKU-YuanGroup/ConsisID: Identity-Preserving Text-to-Video Generation by Frequency Decomposition 2024-12-02
facefusion/facefusion: Industry leading face manipulation platform 2024-11-27
GitHub - HVision-NKU/StoryDiffusion: Create Magic Story!
hpcaitech/Open-Sora: Open-Sora: Democratizing Efficient Video Production for All
hkchengrex/Cutie: [CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation
Tencent/MimicMotion: High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
Hillobar/Rope: GUI-focused roop
jy0205/Pyramid-Flow: Code of Pyramidal Flow Matching for Efficient Video Generative Modeling
VectorSpaceLab/Video-XL: 🔥🔥First-ever hour scale video understanding models
anliyuan/Ultralight-Digital-Human: 一个超轻量级、可以在移动端实时运行的数字人模型
antgroup/echomimic_v2: EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
Zejun-Yang/AniPortrait: AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
HelloVision/HelloMeme: The official HelloMeme GitHub site
facebookresearch/sapiens: High-resolution models for human tasks.
genmoai/mochi: The best OSS video generation models
THUDM/CogVideo: text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Ji4chenLi/t2v-turbo: Code repository for T2V-Turbo and T2V-Turbo-v2
Lightricks/LTX-Video: Official repository for LTX-Video
sipie800/ComfyUI-PuLID-Flux-Enhanced 2024-12-02
EvilBT/ComfyUI_SLK_joy_caption_two: ComfyUI Node 2024-11-27
huchenlei/ComfyUI-layerdiffuse: Layer Diffuse custom nodes 2024-11-27
kijai/ComfyUI-IC-Light: Using IC-LIght models in ComfyUI 2024-11-27
kijai/ComfyUI-CogVideoXWrapper
Lightricks/ComfyUI-LTXVideo: LTX-Video Support for ComfyUI
smthemex/ComfyUI_EchoMimic: You can using EchoMimic in ComfyUI
logtd/ComfyUI-MochiEdit: ComfyUI nodes to edit videos using Genmo Mochi
kijai/ComfyUI-SUPIR: SUPIR upscaling wrapper for ComfyUI
HelloVision/ComfyUI_HelloMeme: Official comfyui repository of Hellomeme
alimama-creative/SDXL_EcomID_ComfyUI
Gourieff/comfyui-reactor-node: Fast and Simple Face Swap Extension Node for ComfyUI
kijai/ComfyUI-Florence2: Inference Microsoft Florence2 VLM
GiusTex/ComfyUI-DiffusersImageOutpaint: Diffusers Image Outpaint for ComfyUI
logtd/ComfyUI-Fluxtapoz: Nodes for image juxtaposition for Flux in ComfyUI
WASasquatch/was-node-suite-comfyui: An extensive node suite for ComfyUI with over 210 new nodes
ZHO-ZHO-ZHO/ComfyUI-InstantID: Unofficial implementation of InstantID for ComfyUI
kijai/ComfyUI-LivePortraitKJ: ComfyUI nodes for LivePortrait
PowerHouseMan/ComfyUI-AdvancedLivePortrait
TemryL/ComfyUI-IDM-VTON: ComfyUI adaptation of IDM-VTON for virtual try-on.
city96/ComfyUI-GGUF: GGUF Quantization support for native ComfyUI models
FizzleDorf/ComfyUI_FizzNodes: Custom Nodes for Comfyui
balazik/ComfyUI-PuLID-Flux: PuLID-Flux ComfyUI implementation
kijai/ComfyUI-PyramidFlowWrapper
erosDiffusion/ComfyUI-enricos-nodes: Compositor Node experiments
logtd/ComfyUI-Fluxtapoz: Nodes for image juxtaposition for Flux in ComfyUI
GreenLandisaLie/AuraSR-ComfyUI: ComfyUI implementation of AuraSR
taabata/ComfyCanvas: Canvas to use with ComfyUI
smthemex/ComfyUI_Sapiens: You can call Using Sapiens to get seg,normal,pose,depth,mask
1038lab/ComfyUI-RMBG: A ComfyUI node for removing image backgrounds using RMBG-2.0.
marduk191/ComfyUI-Fluxpromptenhancer: A Prompt Enhancer for flux.1 in ComfyUI
Lightricks/ComfyUI-LTXVideo: LTX-Video Support for ComfyUI
open-webui/open-webui: User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
continue-revolution/sd-webui-segment-anything: Segment Anything for Stable Diffusion WebUI
lllyasviel/stable-diffusion-webui-forge
aigc-apps/sd-webui-EasyPhoto: 📷 EasyPhoto | Your Smart AI Photo Generator.
THUDM/GLM-4-Voice: GLM-4-Voice | 端到端中英语音对话模型
oobabooga/text-generation-webui: A Gradio web UI for Large Language Models.
ollama/ollama: Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.
SillyTavern/SillyTavern: LLM Frontend for Power Users.
InternLM/InternLM: Official release of InternLM2.5 base and chat models. 1M context support
hiyouga/LLaMA-Factory: Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024) 2024-12-02
cocktailpeanut/fluxgym: Dead simple FLUX LoRA training UI with LOW VRAM support
Nerogar/OneTrainer: OneTrainer is a one-stop solution for all your stable diffusion training needs.
chengyou-jia/ChatGen 2024-12-02
erwold/qwen2vl-flux 2024-11-27
Yuanshi9815/OminiControl: A minimal and universal controller for FLUX.1. 2024-11-27
lllyasviel/sd-forge-layerdiffuse: [WIP] Layer Diffusion for WebUI (via Forge) 2024-11-27
ali-vilab/ACE: All-round Creator and Editor
mit-han-lab/hart: HART: Efficient Visual Generation with Hybrid Autoregressive Transformer
xinsir6/ControlNetPlus: ControlNet++: All-in-one ControlNet for image generations and editing!
Kwai-Kolors/Kolors: Kolors Team
Xiaojiu-z/Stable-Hair: Stable-Hair: Real-World Hair Transfer via Diffusion Model
black-forest-labs/flux: Official inference repo for FLUX.1 models
lllyasviel/Omost: Your image is almost there!
gligen/GLIGEN: Open-Set Grounded Text-to-Image Generation
lllyasviel/IC-Light: More relighting!
instantX-research/InstantID: InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
ali-vilab/In-Context-LoRA: Official repository of In-Context LoRA for Diffusion Transformers
mit-han-lab/nunchaku: SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
ChenyangSi/FreeU: FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)
Nutlope/logocreator: A free + OSS logo generator powered by Flux on Together AI
NVlabs/Sana: SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
JackAILab/ConsistentID: Customized ID Consistent for human
netease-youdao/EmotiVoice: EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
haidog-yaqub/EzAudio: High-quality Text-to-Audio Generation with Efficient Diffusion Transformer
2noise/ChatTTS: A generative speech model for daily dialogue.
fishaudio/fish-speech: Brand new TTS solution
VAST-AI-Research/TripoSR 2024-11-27
HengyiWang/spann3r: 3D Reconstruction with Spatial Memory
LC044/WeChatMsg: 提取微信聊天记录,将其导出成HTML、Word、Excel文档永久保存,对聊天记录进行分析生成年度聊天报告,用聊天数据训练专属于个人的AI聊天助手
gabrielchua/open-notebooklm: Convert any PDF into a podcast episode!
getomni-ai/zerox: Zero shot pdf OCR with gpt-4o-mini
opendatalab/PDF-Extract-Kit: A Comprehensive Toolkit for High-Quality PDF Content Extraction
Nutlope/llama-ocr: Document to Markdown OCR library with Llama 3.2 vision
showlab/ShowUI: Repository for ShowUI: One Vision-Language-Action Model for GUI Visual Agent 2024-12-02
turboderp/exllamav2: A fast inference library for running LLMs locally on modern consumer-class GPUs 2024-12-02
instructor-ai/instructor: structured outputs for llms 2024-12-02
Comprehensive Guide to Prompting Techniques - Instructor 2024-12-02
deepseek-ai/DeepSeek-VL: DeepSeek-VL: Towards Real-World Vision-Language Understanding
dynobo/normcap: OCR powered screen-capture tool to capture information instead of images
modelscope/DiffSynth-Studio: Enjoy the magic of Diffusion models!
abi/screenshot-to-code: Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
stackblitz/bolt.new: Prompt, run, edit, and deploy full-stack web applications
lean-dojo/LeanCopilot: LLMs as Copilots for Theorem Proving in Lean
GitHub - 3b1b/manim: Animation engine for explanatory math videos
GitHub - KindXiaoming/pykan: Kolmogorov Arnold Networks
FujiwaraChoki/MoneyPrinter: Automate Creation of YouTube Shorts using MoviePy.
harry0703/MoneyPrinterTurbo: 利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
idootop/mi-gpt: 🏠 将小爱音箱接入 ChatGPT 和豆包,改造成你的专属语音助手。
wan-h/awesome-digital-human-live2d: Awesome Digital Human
HqWu-HITCS/Awesome-Chinese-LLM: 整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
Hannibal046/Awesome-LLM: Awesome-LLM: a curated list of Large Language Model
excalidraw/excalidraw: Virtual whiteboard for sketching hand-drawn like diagrams
meltylabs/melty: Chat first code editor. To download the packaged app:
gpt-omni/mini-omni2: Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。
Qwen2vl Flux Mini Demo - a Hugging Face Space by Djrango 2024-12-02
IC Light V2-Vary - a Hugging Face Space by lllyasviel 2024-12-02
IllusionDiffusion - a Hugging Face Space by AP123 2024-12-02
ReplaceAnything - a Hugging Face Space by modelscope 2024-12-02
QwQ-32B-Preview - a Hugging Face Space by Qwen 2024-12-02
OminiControl - a Hugging Face Space by Yuanshi 2024-11-27
ACE-Chat - a Hugging Face Space by scepter-studio
MoGe - a Hugging Face Space by Ruicheng
EzAudio - a Hugging Face Space by OpenSound
NaturalSpeech3 FACodec - a Hugging Face Space by amphion
IDM VTON - a Hugging Face Space by yisol
AnimateDiff-Lightning - a Hugging Face Space by ByteDance
Omost - a Hugging Face Space by lllyasviel
CLIP Interrogator - a Hugging Face Space by pharmapsychotic
Pyramid Flow - a Hugging Face Space by Pyramid-Flow
Joy Caption Alpha Two - a Hugging Face Space by fancyfeast
IC Light V2 - a Hugging Face Space by lllyasviel
MaskGCT TTS Demo - a Hugging Face Space by amphion
OmniGen - a Hugging Face Space by Shitao
MotionCLR - a Hugging Face Space by EvanTHU
SeedEdit-APP-V1.0 - a Hugging Face Space by ByteDance
Framer - a Hugging Face Space by wwen1997
BRIA RMBG 2.0 - a Hugging Face Space by briaai
MinerU - a Hugging Face Space by opendatalab
Qwen Turbo 1M Demo - a Hugging Face Space by Qwen
DimensionX - a Hugging Face Space by fffiloni
PhotoMaker V2 - a Hugging Face Space by TencentARC
OOTDiffusion - a Hugging Face Space by levihsu
moondream2 - a Hugging Face Space by vikhyatk
使用 diffusers 训练你自己的 ControlNet 🧨
Stable Diffusion 3.5 Prompt Guide — Stability AI
使用 ChatGPT 进行写作的学生指南 |开放人工智能 --- A Student’s Guide to Writing with ChatGPT | OpenAI
richards199999/Thinking-Claude: Let your Claude able to think
hesamsheikh/ml-retreat: Machine Learning Journal for Intermediate to Advanced Topics.
Midjourney Documentation and User Guide
Disco Diffusion Portrait Study (by @enviraldesign) - Google 文档
alibaba/animate-anything: Fine-Grained Open Domain Image Animation with Motion Guidance
GitHub - prophesier/diff-svc: Singing Voice Conversion via diffusion model
TencentARC/GFPGAN: GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
guide to installing disco v5+ locally on windows
clip_interrogator.ipynb - Colaboratory
A Traveler’s Guide to the Latent Space
Disco Diffusion Illustrated Settings
Artist Studies by @remi_durant
CLIP Prompt Engineering for Generative Art - matthewmcateer.me