- Philadelphia
-
07:57
(UTC -05:00) - https://jinhaoduan.github.io
Stars
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)
A bibliography and survey of the papers surrounding o1
A suite of image and video neural tokenizers
Code for "Diffusion Model Alignment Using Direct Preference Optimization"
[ACL 2024] Shifting Attention to Relevance: Towards the Predictive Uncertainty Quantification of Free-Form Large Language Models
https://arxiv.org/abs/2408.02032
The official implementation of Self-Play Preference Optimization (SPPO)
The official implementation of Self-Play Fine-Tuning (SPIN)
Robust recipes to align language models with human and AI preferences
OmniGibson: a platform for accelerating Embodied AI research built upon NVIDIA's Omniverse engine. Join our Discord for support: https://discord.gg/bccR5vGFEx
The repository for ACL 2024 paper "When to Trust LLMs: Aligning Confidence with Response Quality"
Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.
This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?
RAID is the largest and most challenging benchmark for machine-generated text detectors. (ACL 2024)
[ICML 2024] Official repository for "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"
🚀 Easy, open-source LLM finetuning with one-line commands, seamless cloud integration, and popular optimization frameworks. ✨
Diffusion Models in Medical Imaging (Published in Medical Image Analysis Journal)
[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.
A method of ensemble learning for heterogeneous large language models.
Implementation of "Decoding-time Realignment of Language Models", ICML 2024.
A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems