Skip to content
View jinhaoduan's full-sized avatar

Block or report jinhaoduan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 5 Updated Oct 18, 2023

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)

Python 2,778 260 Updated Nov 25, 2024

A list of all public EEG-datasets

2,285 529 Updated Aug 5, 2024

A bibliography and survey of the papers surrounding o1

TeX 820 37 Updated Nov 16, 2024

A suite of image and video neural tokenizers

Python 888 20 Updated Nov 13, 2024

Code for "Diffusion Model Alignment Using Direct Preference Optimization"

Python 271 23 Updated Dec 28, 2023

Biomedical Question Answering Datasets.

83 5 Updated Jul 11, 2023

[ACL 2024] Shifting Attention to Relevance: Towards the Predictive Uncertainty Quantification of Free-Form Large Language Models

Python 38 3 Updated Sep 4, 2024

https://arxiv.org/abs/2408.02032

Python 69 5 Updated Oct 27, 2024

The official implementation of Self-Play Preference Optimization (SPPO)

Python 500 62 Updated Nov 23, 2024

The official implementation of Self-Play Fine-Tuning (SPIN)

Python 1,051 92 Updated May 8, 2024

Robust recipes to align language models with human and AI preferences

Python 4,727 414 Updated Nov 21, 2024

OmniGibson: a platform for accelerating Embodied AI research built upon NVIDIA's Omniverse engine. Join our Discord for support: https://discord.gg/bccR5vGFEx

Python 533 55 Updated Nov 27, 2024

The repository for ACL 2024 paper "When to Trust LLMs: Aligning Confidence with Response Quality"

Python 3 Updated Aug 12, 2024
Python 10 Updated Feb 29, 2024

Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,041 44 Updated Nov 19, 2024

This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?

Python 738 47 Updated Oct 24, 2024

RAID is the largest and most challenging benchmark for machine-generated text detectors. (ACL 2024)

Python 37 14 Updated Nov 8, 2024

[ICML 2024] Official repository for "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"

Python 687 69 Updated Jul 30, 2024
Python 186 27 Updated Nov 25, 2024

🚀 Easy, open-source LLM finetuning with one-line commands, seamless cloud integration, and popular optimization frameworks. ✨

Python 85 3 Updated Aug 14, 2024

Diffusion Models in Medical Imaging (Published in Medical Image Analysis Journal)

1,553 148 Updated Oct 15, 2024

[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.

Jupyter Notebook 1,953 235 Updated Nov 26, 2024

DataComp for Language Models

HTML 1,163 108 Updated Nov 26, 2024

A method of ensemble learning for heterogeneous large language models.

Python 33 3 Updated Aug 7, 2024

Implementation of "Decoding-time Realignment of Language Models", ICML 2024.

Jupyter Notebook 16 1 Updated Jun 17, 2024
Python 20 Updated Jun 24, 2024

A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems

203 12 Updated Sep 19, 2024
Next