Skip to content
View junjie18's full-sized avatar

Block or report junjie18

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Python tool for converting files and office documents to Markdown.

Python 34,580 1,525 Updated Jan 16, 2025

Muon optimizer for neural networks: >30% extra sample efficiency, <3% wallclock overhead

Python 211 6 Updated Jan 4, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 7,401 714 Updated Jan 19, 2025

MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts

Python 163 4 Updated Oct 16, 2024

O1 Replication Journey

1,882 58 Updated Jan 14, 2025

A reading list on LLM based Synthetic Data Generation 🔥

973 55 Updated Nov 5, 2024

Data and tools for generating and inspecting OLMo pre-training data.

Python 1,061 116 Updated Jan 15, 2025

A PyTorch Native LLM Training Framework

Python 694 36 Updated Dec 27, 2024

🙌 OpenHands: Code Less, Make More

Python 43,884 4,862 Updated Jan 19, 2025

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 13,688 1,354 Updated Dec 25, 2024

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 12,891 2,631 Updated Jan 19, 2025

The Memory layer for your AI apps

Python 24,009 2,220 Updated Jan 19, 2025

The MATH Dataset (NeurIPS 2021)

Python 985 90 Updated Aug 5, 2024

LLM101n: Let's build a Storyteller

31,044 1,698 Updated Aug 1, 2024

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,462 59 Updated Aug 15, 2024

Video+code lecture on building nanoGPT from scratch

Python 3,791 540 Updated Aug 13, 2024

Kolmogorov Arnold Networks

Jupyter Notebook 15,321 1,435 Updated Jan 12, 2025

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 5,748 482 Updated Jan 17, 2025

An efficient video loader for deep learning with smart shuffling that's super easy to digest

C++ 1,974 168 Updated Jul 17, 2024

My favorite C programming practices.

2,026 98 Updated Oct 1, 2020

Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]

Python 130 6 Updated Sep 20, 2024

🚀 KIMI AI 长文本大模型逆向API【特长:长文本解读整理】,支持高速流式输出、智能体对话、联网搜索、探索版、K1思考模型、长文档解读、图像解析、多轮对话,零配置部署,多路token支持,自动清理会话痕迹,仅供测试,如需商用请前往官方开放平台。

TypeScript 4,088 680 Updated Dec 30, 2024

Tile primitives for speedy kernels

Cuda 1,938 96 Updated Jan 19, 2025

The hub for EleutherAI's work on interpretability and learning dynamics

Jupyter Notebook 2,340 175 Updated Dec 5, 2024

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

4,037 199 Updated Sep 25, 2024
Python 274 14 Updated Jul 28, 2024

《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing

Java 107,436 13,422 Updated Jan 14, 2025

A library for advanced large language model reasoning

Python 1,666 147 Updated Jan 16, 2025

A minimal GPU design in Verilog to learn how GPUs work from the ground up

SystemVerilog 7,352 566 Updated Aug 18, 2024
Next