Skip to content
View arielsho's full-sized avatar
🦾
🦾

Highlights

  • Pro

Block or report arielsho

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 4,572 448 Updated Feb 11, 2025

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python 2,561 194 Updated Feb 7, 2025

Fully open reproduction of DeepSeek-R1

Python 19,135 1,618 Updated Feb 11, 2025

Open Source framework for voice and multimodal conversational AI

Python 4,673 521 Updated Feb 12, 2025

DSPy: The framework for programming—not prompting—language models

Python 21,792 1,647 Updated Feb 10, 2025

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 22,320 2,223 Updated Feb 12, 2025

Chat Templates for 🤗 HuggingFace Large Language Models

Jinja 604 55 Updated Dec 13, 2024

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 39,658 5,264 Updated Feb 11, 2025

Ongoing research training transformer models at scale

Python 11,324 2,541 Updated Feb 11, 2025
Python 679 73 Updated Feb 4, 2025

A list of AI autonomous agents

14,266 1,052 Updated Jan 8, 2025

Build Text Rerankers with Deep Language Models

Python 255 24 Updated Feb 20, 2024

Lark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity.

Python 5,091 426 Updated Jan 21, 2025

NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

Python 4,406 433 Updated Feb 11, 2025

BLEURT is a metric for Natural Language Generation based on transfer learning.

Python 714 85 Updated Aug 4, 2023

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 46,144 4,905 Updated Jan 22, 2025

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 16,180 2,329 Updated Feb 10, 2025

Set of tools to assess and improve LLM security.

Python 2,888 479 Updated Jan 29, 2025

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Python 2,351 148 Updated Feb 11, 2025

Go ahead and axolotl questions

Python 8,558 945 Updated Feb 12, 2025

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Python 3,231 401 Updated Nov 18, 2024

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,232 830 Updated Jun 10, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 37,435 5,630 Updated Feb 12, 2025

Grok open release

Python 49,903 8,336 Updated Aug 30, 2024

LLM plugin for clustering embeddings

Python 68 5 Updated Mar 1, 2024

SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 14+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.

Python 7,161 559 Updated Feb 12, 2025

A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.

Python 1,695 199 Updated Feb 4, 2025

Unified framework for building enterprise RAG pipelines with small, specialized models

Python 8,933 1,547 Updated Jan 25, 2025
Next