Ansible is a radically simple IT automation platform that makes your applications and systems easier to deploy and maintain. Automate everything from code deployment to network configuration to clo…

Python 63,370 23,933 Updated Dec 10, 2024

skills / github-pages

Create a site or blog from your GitHub repositories with GitHub Pages.

1,385 791 Updated Feb 26, 2024

QwenLM / Qwen2-VL

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 3,562 222 Updated Dec 4, 2024

chongzhou96 / EdgeSAM

Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"

Jupyter Notebook 949 42 Updated Aug 12, 2024

IDEA-Research / Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 15,377 1,413 Updated Sep 5, 2024

allenai / open-instruct

Python 2,118 237 Updated Dec 8, 2024

microsoft / BitNet

Official inference framework for 1-bit LLMs

C++ 12,318 859 Updated Nov 11, 2024

gitroomhq / postiz-app

📨 The ultimate social media scheduling tool, with a bunch of AI 🤖

TypeScript 14,199 2,437 Updated Dec 12, 2024

apple / ml-slowfast-llava

SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models

Python 185 12 Updated Sep 16, 2024

gokayfem / awesome-vlm-architectures

Famous Vision Language Models and Their Architectures

Markdown 493 25 Updated Sep 8, 2024

SHI-Labs / VCoder

VCoder: Versatile Vision Encoders for Multimodal Large Language Models, arXiv 2023 / CVPR 2024

Python 266 15 Updated Apr 17, 2024

ItzCrazyKns / Perplexica

Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI

TypeScript 17,356 1,613 Updated Dec 5, 2024

zeroth-robotics / zeroth-bot

3D-printed open-source humanoid robot platform for sim-to-real and RL

Rust 377 56 Updated Dec 4, 2024

ultralytics / yolo-ios-app

Ultralytics YOLO iOS App source code for running YOLOv8 in your own iOS apps 🌟

Swift 175 34 Updated Dec 12, 2024

RupertLuo / Valley

The official repository of "Video assistant towards large language model makes everything easy"

Python 210 14 Updated Feb 22, 2024

Huanshere / VideoLingo

Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音，一键全自动视频搬运AI字幕组

Python 8,320 800 Updated Dec 13, 2024

khoj-ai / khoj

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous …

Python 17,168 833 Updated Dec 13, 2024

LLaVA-VL / LLaVA-Plus-Codebase

LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills

Python 716 53 Updated Feb 1, 2024

usefulsensors / moonshine

Fast and accurate automatic speech recognition (ASR) for edge devices

Python 2,315 109 Updated Dec 9, 2024

Ucas-HaoranWei / GOT-OCR2.0

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 6,304 547 Updated Dec 8, 2024

jy0205 / Pyramid-Flow

Code of Pyramidal Flow Matching for Efficient Video Generative Modeling

Python 2,575 253 Updated Dec 8, 2024

Alpha-VLLM / LLaMA2-Accessory

An Open-source Toolkit for LLM Development

Python 2,731 176 Updated May 24, 2024

datawhalechina / leedl-tutorial

《李宏毅深度学习教程》（李宏毅老师推荐👍，苹果书🍎），PDF下载地址：https://github.com/datawhalechina/leedl-tutorial/releases

Jupyter Notebook 14,024 2,927 Updated Dec 13, 2024

HeyPuter / puter

🌐 The Internet OS! Free, Open-Source, and Self-Hostable.

JavaScript 26,937 1,908 Updated Dec 13, 2024

Zeyi-Lin / HivisionIDPhotos

⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。

Python 13,464 1,408 Updated Nov 20, 2024

linghai06

Lists (32)

3D

anime

app

app dev

ASR

audio_processing

automl

coding and algorithms learning

depth_esitmation

frame interpolation

GAN

human_3d_mesh

image synthesis

impainting

Investment

Machine learning data

Machine learning facility

matting

NLP

ocr

RL

road features detection

scene_reconstruction

segmentaion

Self-driving

simulation

SLAM

SOD

talking_head

tracking and detection

transformer

web

Stars