Stars
A collection of Mathematics + CS + what have you related books collected over the years for school 🎓 and personal reading 📚.
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
This is the official code for the paper "Vaccine: Perturbation-aware Alignment for Large Language Models" (NeurIPS2024)
A survey on harmful fine-tuning attack for large language model
Official Repository for The Paper: Safety Alignment Should Be Made More Than Just a Few Tokens Deep
🤗 smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous …
Reading list for adversarial perspective and robustness in deep reinforcement learning.
AutoPrompt: Automatic Prompt Construction for Masked Language Models.
A playbook for effectively prompting post-trained LLMs
A playbook for systematically maximizing the performance of deep learning models.
[WIP] Resources for AI engineers. Also contains supporting materials for the book AI Engineering (Chip Huyen, 2025)
This is the code repository for ACL2024 paper: Exploring Memorization in Fine-tuned Language Models
✨✨Latest Advances on Multimodal Large Language Models
Collection of AWESOME vision-language models for vision tasks
This is the reading list mainly on adversarial examples (attacks, defenses, etc.) I try to keep and update regularly.
AAAI 2019 oral presentation
Direct Preference Optimization from scratch in PyTorch
🏅 Collection of Kaggle Solutions and Ideas 🏅
Official Pytorch implementation of "Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations"
[ACL 2024] Prompt Optimization via Adversarial In-Context Learning
MLNLP社区用来帮助大家避免论文投稿小错误的整理仓库。 Paper Writing Tips
Aligning Large Language Models with Human: A Survey
Experiments and data for the paper "When and why vision-language models behave like bags-of-words, and what to do about it?" Oral @ ICLR 2023
A continually updated list of literature on Reinforcement Learning from AI Feedback (RLAIF)
Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learning.