Starred repositories
Official implementation of DeepLabCut: Markerless pose estimation of user-defined features with deep learning for all animals incl. humans
End-to-End Object Detection with Transformers
Rembg is a tool to remove images background
🔥 2D and 3D Face alignment library build using pytorch
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
Qwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud.
Pupil segmentation and gaze estimation using fully convolutional neural networks
This is the official code and data for paper "SynBlink and BlinkFormer: A Synthetic Dataset and Transformer-Based Method for Video Blink Detection", accepted by BMVC 2023.
[CVPR2023] A Hierarchical Representation Network for Accurate and Detailed Face Reconstruction from In-The-Wild Images.
Unsupervised High-Resolution Portrait Gaze Correction and Animation (TIP 2022)
Bringing Characters to Life with Computer Brains in Unity
PantoMatrix: Generating Face and Body Animation from Speech
[NeurIPS 2023] Official implementation of the paper "Motion-X: A Large-scale 3D Expressive Whole-body Human Motion Dataset"
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with PyTorch / Transformers / LLaMA Factory / Ultralytics / MMEng…
T3Bench: Benchmarking Current Progress in Text-to-3D Generation
Demo Programs for the "Talking Head(?) Anime from a Single Image 3: Now the Body Too" Project
[CVPR 2023] Real-time Multi-person Eyeblink Detection in the Wild for Untrimmed Video
This is the official release for paper "Real-Time Gaze Tracking with Event-Driven Eye Segmentation"
Code for the paper "LightTS: Lightweight Time Series Classification with Adaptive Ensemble"
The GitHub repository for the paper "Informer" accepted by AAAI 2021.
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Blender Data generation Head Pose and Facial depth