Lists (32)
Sort Name ascending (A-Z)
3D
anime
app
app dev
ASR
audio_processing
automl
coding and algorithms learning
depth_esitmation
frame interpolation
GAN
human_3d_mesh
image synthesis
impainting
Investment
Machine learning data
Machine learning facility
matting
NLP
ocr
RL
road features detection
scene_reconstruction
segmentaion
Self-driving
simulation
SLAM
SOD
talking_head
tracking and detection
transformer
web
Stars
- All languages
- AppleScript
- Batchfile
- C
- C#
- C++
- CMake
- CSS
- Clojure
- Cuda
- Dart
- Dockerfile
- Erlang
- FreeMarker
- Go
- HTML
- Java
- JavaScript
- Jinja
- Jupyter Notebook
- Kotlin
- Less
- Lua
- MATLAB
- MDX
- Makefile
- Markdown
- Mojo
- Nunjucks
- Objective-C
- OpenEdge ABL
- PHP
- PLpgSQL
- Pascal
- Python
- R
- Reason
- Rich Text Format
- Ruby
- Rust
- Scala
- Shell
- Svelte
- Swift
- SystemVerilog
- TeX
- TypeScript
- Verilog
- Vim Script
- Vue
🔥🔥First-ever hour scale video understanding models
Images to inference with no labeling (use foundation models to train supervised models).
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
Anki's shared backend and web components, and the Qt frontend
Ansible is a radically simple IT automation platform that makes your applications and systems easier to deploy and maintain. Automate everything from code deployment to network configuration to clo…
Create a site or blog from your GitHub repositories with GitHub Pages.
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
📨 The ultimate social media scheduling tool, with a bunch of AI 🤖
SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models
Famous Vision Language Models and Their Architectures
VCoder: Versatile Vision Encoders for Multimodal Large Language Models, arXiv 2023 / CVPR 2024
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
3D-printed open-source humanoid robot platform for sim-to-real and RL
Ultralytics YOLO iOS App source code for running YOLOv8 in your own iOS apps 🌟
The official repository of "Video assistant towards large language model makes everything easy"
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous …
LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills
Fast and accurate automatic speech recognition (ASR) for edge devices
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Code of Pyramidal Flow Matching for Efficient Video Generative Modeling
An Open-source Toolkit for LLM Development
《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases
🌐 The Internet OS! Free, Open-Source, and Self-Hostable.
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。