Lists (32)
Sort Name ascending (A-Z)
3D animation
3D EDITOR
3D Rendering GS
3D Scan
ai avatar app
AI SaaS
AirBnB
Apple ML
AR
BlockChain
Charbot
CV
Dev
Ecommerce
Finance
Image to website creation
IOS AR
Langchain
LLM
open source ios apps
Outfitanyone
Retail Product Prediction
Smartglasses
Social Media
Talking Phot
Text to 3D
Text to Speech Avatar
Translation
VFx
Voice cloning
Zoom clone
Starred repositories
Janus-Series: Unified Multimodal Understanding and Generation Models
Web App that allows you to build and test clients for firmware
Tool that generates AI startup investment memorandums
Perplexity style AI Search engine clone built with Gemini 2.0 Flash and Grounding
Stay on top of trending topics on social media and the web with AI
Official implementation of "Neuralangelo: High-Fidelity Neural Surface Reconstruction" (CVPR 2023)
[arXiv 2024] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
OpenEMMA, a permissively licensed open source "reproduction" of Waymo’s EMMA model.
A generative world for general-purpose robotics & embodied AI learning.
A react-based starter app for using the Multimodal Live API over websockets with Gemini
Composable building blocks to build Llama Apps
Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.
The fast, Pythonic way to build Model Context Protocol servers 🚀
Simple, unified interface to multiple Generative AI providers
first base model for full-duplex conversational audio
[NeurIPS 2024] Generalizable and Animatable Gaussian Head Avatar
✨ Hotshot-XL: State-of-the-art AI text-to-GIF model trained to work alongside Stable Diffusion XL
Jobs_Applier_AI_Agent_AIHawk aims to easy job hunt process by automating the job application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in a tailored way.
A simple example implementation of the VoiceRAG pattern to power interactive voice generative AI experiences using RAG with Azure AI Search and Azure OpenAI's gpt-4o-realtime-preview model.
o1-engineer is a command-line tool designed to assist developers in managing and interacting with their projects efficiently. Leveraging the power of OpenAI's API, this tool provides functionalitie…
Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"
This repository provides tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a comprehensive guide for building intelligent, interactive A…
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
Generate a video script, voice and a talking face completely with AI