Skip to content
View everyoneelse's full-sized avatar

Block or report everyoneelse

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]

Python 52 6 Updated Nov 14, 2024
Python 44 2 Updated Jan 24, 2024

[NeurIPS 2024 Main Track] Code for the paper titled "Instruction Tuning With Loss Over Instructions"

Python 29 7 Updated May 24, 2024

This repository is the official implementation of DoFIT (NeurIPS 2024).

Python 3 Updated Nov 24, 2024

Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.

346 11 Updated Apr 18, 2024

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 2,999 285 Updated Dec 13, 2024

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 22,645 5,540 Updated Aug 14, 2024

I built a gpt-2 style tokenizer that can be trained on any .txt data to generate tokens

Python 1 Updated Oct 12, 2024

Chinese tokens in tiktoken tokenizers.

30 Updated May 15, 2024

TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and without the use of translation, and is designed for the trai…

Python 295 43 Updated May 28, 2020

This is an NVIDIA AI Workbench example project that demonstrates an end-to-end model development workflow using Llamafactory.

Jupyter Notebook 39 15 Updated Oct 17, 2024

[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning

Jupyter Notebook 386 37 Updated Oct 20, 2024

Generates and optimizes Haiku system and user prompts for classification

HTML 10 1 Updated Nov 2, 2024

This repo includes ChatGPT prompt curation to use ChatGPT better.

HTML 113,979 15,561 Updated Nov 11, 2024
Jupyter Notebook 9,408 645 Updated Jul 29, 2024

A method for calculating scaling laws for LLMs from publicly available models

Python 8 Updated Apr 22, 2024

minimal LLM scripts for 24GB VRAM GPUs. training, inference, whatever

Jupyter Notebook 35 2 Updated Dec 3, 2024

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Jupyter Notebook 7,191 461 Updated Nov 6, 2024

The Open Cookbook for Top-Tier Code Large Language Model

Python 1,430 85 Updated Dec 8, 2024

Repair invalid JSON documents

TypeScript 582 37 Updated Dec 3, 2024

Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).

759 24 Updated Jul 20, 2023

DSPy: The framework for programming—not prompting—language models

Python 20,153 1,527 Updated Dec 13, 2024

A framework for prompt tuning using Intent-based Prompt Calibration

Python 2,258 197 Updated Nov 23, 2024

[ICML 2024] TrustLLM: Trustworthiness in Large Language Models

Python 491 47 Updated Sep 29, 2024

This is the code repo for our paper "RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards".

Python 18 1 Updated Dec 2, 2024

一种任务级GPU算力分时调度的高性能深度学习训练平台

Python 318 40 Updated Oct 24, 2023

Evaluating the Ripple Effects of Knowledge Editing in Language Models

Python 53 4 Updated Apr 15, 2024

Demystifying Verbatim Memorization in Large Language Models

Python 3 1 Updated Aug 16, 2024
Next