Skip to content
View fataoup's full-sized avatar
  • Hefei,Anhui,China
  • 17:02 (UTC +08:00)

Highlights

  • Pro

Block or report fataoup

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An open-source implementaion for fine-tuning Qwen2-VL series by Alibaba Cloud.

Python 146 16 Updated Dec 12, 2024

SimVQ: Addressing Representation Collapse in Vector Quantized Models with One Linear Layer

Python 169 4 Updated Dec 5, 2024

Visualizing the attention of vision-language models

Jupyter Notebook 87 6 Updated Oct 26, 2024

SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models

Python 185 12 Updated Sep 16, 2024

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 3,577 224 Updated Dec 4, 2024

Utilities intended for use with Llama models.

Python 5,300 884 Updated Dec 10, 2024

《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀

Shell 52,622 11,620 Updated Dec 12, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 6,376 492 Updated Dec 10, 2024

Reference implementation for DPO (Direct Preference Optimization)

Python 2,246 186 Updated Aug 11, 2024

SEED-Voken: A Series of Powerful Visual Tokenizers

Python 771 30 Updated Dec 4, 2024

A Python toolbox for performing gradient-free optimization

Python 3,980 356 Updated Dec 5, 2024

[COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition

Python 604 36 Updated Jul 22, 2024

[ECCV 2024🔥] Official implementation of the paper "ST-LLM: Large Language Models Are Effective Temporal Learners"

Python 127 4 Updated Sep 10, 2024

MambaOut: Do We Really Need Mamba for Vision?

Python 2,082 35 Updated Oct 22, 2024

[COLING 2024] Official code for paper "Few-shot Temporal Pruning Accelerates Diffusion Models for Text Generation".

Python 3 Updated Jul 27, 2024

The official Meta Llama 3 GitHub site

Python 27,493 3,132 Updated Aug 12, 2024

Official release of InternLM2.5 base and chat models. 1M context support

Python 6,569 461 Updated Nov 21, 2024

[NAACL 2024] LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-text Generation?

Python 37 2 Updated Jun 9, 2024

[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-sim…

Python 6,163 418 Updated Dec 6, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 22,607 2,218 Updated Nov 28, 2024

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 2,720 255 Updated Aug 9, 2024

Painter & SegGPT Series: Vision Foundation Models from BAAI

Python 2,536 176 Updated Dec 6, 2024

Video datasets

1,242 95 Updated Mar 8, 2023

Fast and memory-efficient exact attention

Python 14,627 1,373 Updated Dec 13, 2024

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 6,535 585 Updated May 31, 2024

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

3,625 205 Updated Dec 13, 2024

This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.

Python 1,098 53 Updated Nov 22, 2024

Generative Models by Stability AI

Python 24,856 2,764 Updated Sep 4, 2024

Implementation of MagViT2 Tokenizer in Pytorch

Python 576 33 Updated Oct 14, 2024
Next