Skip to content
View yaodongyu's full-sized avatar

Block or report yaodongyu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 2,999 285 Updated Dec 13, 2024

Repo for the research paper "Aligning LLMs to Be Robust Against Prompt Injection"

Python 23 2 Updated Dec 9, 2024

Code implementation of synthetic continued pretraining

Python 66 4 Updated Oct 6, 2024

Forecasting with LLMs

HTML 25 10 Updated Jun 17, 2024

xLSTM as Generic Vision Backbone

Python 445 31 Updated Nov 4, 2024

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

Python 811 33 Updated Dec 4, 2024

The official implementation of DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis

Python 164 9 Updated Jun 28, 2024

Efficient Triton Kernels for LLM Training

Python 3,822 229 Updated Dec 13, 2024

Official repo for consistency models.

Python 6,198 425 Updated Mar 22, 2024

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 10,905 1,084 Updated Dec 9, 2024

[ICML 2024] CLLMs: Consistency Large Language Models

Python 360 18 Updated Nov 16, 2024

Official PyTorch Implementation of the Longhorn Deep State Space Model

Python 42 3 Updated Dec 4, 2024

Official Implementation of Rectified Flow (ICLR2023 Spotlight)

Python 1,004 60 Updated Jul 20, 2024

Codebase for the ICML 2024 paper "Differentially Private Representation Learning via Image Captioning"

Python 9 Updated Jul 22, 2024

Universal and Transferable Attacks on Aligned Language Models

Python 3,505 484 Updated Aug 2, 2024

Code for the paper "Interpreting and Improving Diffusion Models from an Optimization Perspective", appearing in ICML 2024

Python 5 Updated Sep 30, 2024

The official implementation for Pseudo Numerical Methods for Diffusion Models on Manifolds (PNDM, PLMS | ICLR2022)

Python 333 31 Updated Apr 25, 2023

Code for CRATE (Coding RAte reduction TransformEr).

Python 1,184 97 Updated Oct 23, 2024

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Python 1,121 63 Updated Sep 27, 2024

[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

Python 1,161 68 Updated Oct 14, 2024

800,000 step-level correctness labels on LLM solutions to MATH problems

Python 1,713 105 Updated Jun 1, 2023

Fast, memory-efficient, scalable optimization of deep learning with differential privacy

Python 105 19 Updated Nov 19, 2024

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Python 8,544 1,054 Updated Oct 9, 2024

Fully featured implementation of Routing Transformer

Python 287 30 Updated Nov 6, 2021

Sinkhorn Transformer - Practical implementation of Sparse Sinkhorn Attention

Python 255 21 Updated Aug 10, 2021

Transformer based on a variant of attention that is linear complexity in respect to sequence length

Python 707 67 Updated May 5, 2024

[ICML 2021 Oral] We show pure attention suffers rank collapse, and how different mechanisms combat it.

Jupyter Notebook 163 10 Updated Mar 8, 2021

An implementation of Performer, a linear attention-based transformer, in Pytorch

Python 1,101 144 Updated Feb 2, 2022

Reformer, the efficient Transformer, in Pytorch

Python 2,132 256 Updated Jun 21, 2023

A concise but complete full-attention transformer with a set of promising experimental features from various papers

Python 4,891 424 Updated Dec 10, 2024
Next