Starred repositories
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
Official PyTorch implementation of "TinySAM: Pushing the Envelope for Efficient Segment Anything Model"
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
An offical implementation of PatchTST: "A Time Series is Worth 64 Words: Long-term Forecasting with Transformers." (ICLR 2023) https://arxiv.org/abs/2211.14730
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
OpenMMLab Pose Estimation Toolbox and Benchmark.
⏰ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~
A playbook for systematically maximizing the performance of deep learning models.
[ECCV 2022] ByteTrack: Multi-Object Tracking by Associating Every Detection Box
Master the command line, in one page
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction of AlphaFold 2
[AAAI 2020] Official implementation of VAANet for Emotion Recognition
The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation" and [TPAMI'23] "ViTPose++: Vision Transformer for Generic Body Pose Estimation"
This repo contains the code for generating the ToxiGen dataset, published at ACL 2022.
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Code for the paper: Detecting Photoshopped Faces by Scripting Photoshop
[CVPR 2022] MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation
A toolbox for skeleton-based action recognition.
This is the complementary repository of the paper "Reviving a failed network through microscopic interventions"
FinRL: Financial Reinforcement Learning. 🔥
TODS: An Automated Time-series Outlier Detection System
AutoVideo: An Automated Video Action Recognition System