Stars
Small python-gtk application, which helps the user to merge or split PDF documents and rotate, crop and rearrange their pages using an interactive and intuitive graphical interface.
PDFsam, a desktop application to split, merge, mix, rotate PDF files and extract pages
🔥🔥First-ever hour scale video understanding models
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Efficient Inference of Transformer models
Fast and accurate automatic speech recognition (ASR) for edge devices
Leading free and open-source face recognition system
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
High-resolution models for human tasks.
Minimal code and examnples for inferencing Sapiens foundation human models in Pytorch
[ECCV 2024 Oral🔥] Arc2Face: A Foundation Model for ID-Consistent Human Faces
ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation
Tools for merging pretrained large language models.
Example models using DeepSpeed
Implementation for the different ML tasks on Kaggle platform with GPUs.
Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"
Development repository for the Triton language and compiler
Official implementation for "Automatic Chain of Thought Prompting in Large Language Models" (stay tuned & more will be updated)
Machine Learning for Imbalanced Data, published by Packt
Official repository of Slide-Transformer (CVPR2023)
A simple deep learning frame working with dynamic computational graph purely based on numpy