lvjing2

Follow

leo james lvjing2

Follow

serverless, 微服务，devops，云原生，weChat: zzl_ing

17 followers · 26 following

alipay
Shanghai city

Achievements

Achievements

Lists (2)

Sort

AI

✨ Inspiration

Stars

hypertrons / hypertrons-crx

A browser extension for insights into GitHub, Gitee projects and developers.

TypeScript 362 102 Updated Jan 19, 2025

substratusai / kubeai

AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-text.

Go 676 54 Updated Feb 3, 2025

lvjing2 / my-transformer

从 0 手撸 transformer，适合 java 开发者的版本。

Python 2 Updated Oct 28, 2024

bytedance / monolith

A Lightweight Recommendation System

Python 8,499 654 Updated Nov 8, 2023

sofastack / sofa-hessian

An internal improved version of Hessian3/4 powered by Ant Group CO., Ltd.

Java 143 55 Updated Sep 18, 2024

huggingface / text-generation-inference

Large Language Model Text Generation Inference

Python 9,685 1,135 Updated Feb 2, 2025

wangshuai09 / vllm

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 31 6 Updated Jan 13, 2025

pytorch / pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 86,438 23,269 Updated Feb 3, 2025

bojone / papers.cool

Cool Papers - Immersive Paper Discovery

JavaScript 457 7 Updated Jan 15, 2025

colinwilson / lotusdocs

📖 A free, lightweight, modern documentation theme for Hugo

JavaScript 404 90 Updated Jan 29, 2025

state-spaces / mamba

Mamba SSM architecture

Python 13,876 1,193 Updated Jan 18, 2025

ForceInjection / AI-fundermentals

AI 基础知识 - GPU 架构、CUDA 编程以及大模型基础知识

Shell 67 5 Updated Jan 31, 2025

moovweb / gvm

Go Version Manager

Shell 10,541 557 Updated Aug 8, 2024

microsoft / vattention

Dynamic Memory Management for Serving LLMs without PagedAttention

C 275 22 Updated Feb 2, 2025

utkuozdemir / nvidia_gpu_exporter

Nvidia GPU exporter for prometheus using nvidia-smi binary

Go 981 113 Updated Jan 24, 2025

QwenLM / Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 16,378 1,347 Updated Feb 1, 2025

S95Sedan / Deepspeed-Windows

Deepspeed windows information

C++ 36 2 Updated Mar 9, 2024

fengxxc / wechatmp2markdown

微信公众号文章转Markdown

Go 104 21 Updated Sep 26, 2024

koupleless / docs

Koupleless official documentation.

JavaScript 6 10 Updated Jan 22, 2025

koupleless / virtual-kubelet

Go 2 3 Updated Jan 14, 2025

prometheus / prometheus

The Prometheus monitoring system and time series database.

Go 57,058 9,328 Updated Feb 3, 2025

grafana / grafana

The open and composable observability and data visualization platform. Visualize metrics, logs, and traces from multiple sources like Prometheus, Loki, Elasticsearch, InfluxDB, Postgres and many mo…

TypeScript 66,255 12,337 Updated Feb 3, 2025

eclipse-jgit / jgit

JGit, the Java implementation of git

Java 193 55 Updated Feb 2, 2025

donnemartin / system-design-primer

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 287,889 47,963 Updated Dec 2, 2024

QwenLM / Qwen2.5

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 14,577 972 Updated Jan 23, 2025

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 11,235 2,513 Updated Feb 2, 2025

triton-lang / triton

Development repository for the Triton language and compiler

C++ 14,242 1,763 Updated Feb 3, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 35,999 5,449 Updated Feb 3, 2025

git / git

Git Source Code Mirror - This is a publish-only repository but pull requests can be turned into patches to the mailing list via GitGitGadget (https://gitgitgadget.github.io/). Please follow Documen…

C 53,399 25,877 Updated Feb 1, 2025

google / vizier

Python-based research interface for blackbox and hyperparameter optimization, based on the internal Google Vizier Service.

Python 1,530 99 Updated Feb 1, 2025