Stars
A latent text-to-image diffusion model
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Python Data Science Handbook: full text in Jupyter Notebooks
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
pytorch handbook是一本开源的书籍,目标是帮助那些希望和使用PyTorch进行深度学习开发和研究的朋友快速入门,其中包含的Pytorch教程全部通过测试保证可以成功运行
High-Resolution Image Synthesis with Latent Diffusion Models
LAVIS - A One-stop Library for Language-Vision Intelligence
🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
A series of large language models trained from scratch by developers @01-ai
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image genera…
Taming Transformers for High-Resolution Image Synthesis
The pytorch re-implement of the official efficientdet with SOTA performance in real time and pretrained weights.
推荐系统入门教程,在线阅读地址:https://datawhalechina.github.io/fun-rec/
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
【🔞🔞🔞 内含不适合未成年人阅读的图片】基于我擅长的编程、绘画、写作展开的 AI 探索和总结:StableDiffusion 是一种强大的图像生成模型,能够通过对一张图片进行演化来生成新的图片。ChatGPT 是一个基于 Transformer 的语言生成模型,它能够自动为输入的主题生成合适的文章。而 Github Copilot 是一个智能编程助手,能够加速日常编程活动。
A simplified implemention of Faster R-CNN that replicate performance from origin paper
Open-source and strong foundation image recognition models.
🎨 Semantic segmentation models, datasets and losses implemented in PyTorch.
[CVPR 2022] Pastiche Master: Exemplar-Based High-Resolution Portrait Style Transfer
Official PyTorch repo for JoJoGAN: One Shot Face Stylization
Replication of simple CV Projects including attention, classification, detection, keypoint detection, etc.
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
[CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
Analyze the unstructured data with Towhee, such as reverse image search, reverse video search, audio classification, question and answer systems, molecular search, etc.