Highlights
- Pro
Lists (7)
Sort Name ascending (A-Z)
Starred repositories
A web-based collaborative LaTeX editor
Object detection, 3D detection, and pose estimation using center point detection:
Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal Large Language Models'.
A collection list of AIGC detection related papers.
[KDD 2023] What’s Behind the Mask: Understanding Masked Graph Modeling for Graph Autoencoders
[ICLR24] Official PyTorch Implementation of Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors
offical code for MMANet: Margin-aware Distillation and Modality-aware Regularization for Incomplete Multimodal Learning
The official implementation of CMAE https://arxiv.org/abs/2207.13532 and https://ieeexplore.ieee.org/document/10330745
Copy LaTeX Equations as Word Equations, a Chrome Extension
DILAB-HYU / MOCE
Forked from gyeomo/MOCEModel-Oriented Concepts for Explaining Deep Neural Networks
Official implementation of Learning Point-guided Localization for Detection in Remote Sensing Images
[T-PAMI] A curated list of self-supervised multimodal learning resources.
使用Github Action将国外的Docker镜像转存到阿里云私有仓库,供国内服务器使用,免费易用
Vincentqyw / RoMa
Forked from Parskatt/RoMa[CVPR 2024] RoMa: Robust Dense Feature Matching; RoMa is the robust dense feature matcher capable of estimating pixel-dense warps and reliable certainties for almost any image pair.
[CVPR 2024] RoMa: Robust Dense Feature Matching; RoMa is the robust dense feature matcher capable of estimating pixel-dense warps and reliable certainties for almost any image pair.
Understand Human Behavior to Align True Needs
The code of the paper "Multi-view contrastive clustering via integrating graph aggregation and confidence enhancement"
⏰ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~
Multimodal Masked Autoencoders (M3AE): A JAX/Flax Implementation
A collection of literature after or concurrent with Masked Autoencoder (MAE) (Kaiming He el al.).
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners
Code for the paper "Image Clustering with External Guidance" (ICML 2024)
The repo for "Balanced Multimodal Learning via On-the-fly Gradient Modulation", CVPR 2022 (ORAL)
The repo for "Enhancing Multi-modal Cooperation via Sample-level Modality Valuation", CVPR 2024
竞争性自适应重加权采样法(competitive adapative reweighted sampling, CARS)python代码
😎 Everything about class-imbalanced/long-tail learning: papers, codes, frameworks, and libraries | 有关类别不平衡/长尾学习的一切:论文、代码、框架与库