Skip to content
View lvjing2's full-sized avatar
  • alipay
  • Shanghai city

Block or report lvjing2

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A browser extension for insights into GitHub, Gitee projects and developers.

TypeScript 362 102 Updated Jan 19, 2025

AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-text.

Go 676 54 Updated Feb 3, 2025

从 0 手撸 transformer,适合 java 开发者的版本。

Python 2 Updated Oct 28, 2024

A Lightweight Recommendation System

Python 8,499 654 Updated Nov 8, 2023

An internal improved version of Hessian3/4 powered by Ant Group CO., Ltd.

Java 143 55 Updated Sep 18, 2024

Large Language Model Text Generation Inference

Python 9,685 1,135 Updated Feb 2, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 31 6 Updated Jan 13, 2025

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 86,438 23,269 Updated Feb 3, 2025

Cool Papers - Immersive Paper Discovery

JavaScript 457 7 Updated Jan 15, 2025

📖 A free, lightweight, modern documentation theme for Hugo

JavaScript 404 90 Updated Jan 29, 2025

Mamba SSM architecture

Python 13,876 1,193 Updated Jan 18, 2025

AI 基础知识 - GPU 架构、CUDA 编程以及大模型基础知识

Shell 67 5 Updated Jan 31, 2025

Go Version Manager

Shell 10,541 557 Updated Aug 8, 2024

Dynamic Memory Management for Serving LLMs without PagedAttention

C 275 22 Updated Feb 2, 2025

Nvidia GPU exporter for prometheus using nvidia-smi binary

Go 981 113 Updated Jan 24, 2025

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 16,378 1,347 Updated Feb 1, 2025

Deepspeed windows information

C++ 36 2 Updated Mar 9, 2024

微信公众号文章转Markdown

Go 104 21 Updated Sep 26, 2024

Koupleless official documentation.

JavaScript 6 10 Updated Jan 22, 2025

The Prometheus monitoring system and time series database.

Go 57,058 9,328 Updated Feb 3, 2025

The open and composable observability and data visualization platform. Visualize metrics, logs, and traces from multiple sources like Prometheus, Loki, Elasticsearch, InfluxDB, Postgres and many mo…

TypeScript 66,255 12,337 Updated Feb 3, 2025

JGit, the Java implementation of git

Java 193 55 Updated Feb 2, 2025

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 287,889 47,963 Updated Dec 2, 2024

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 14,577 972 Updated Jan 23, 2025

Ongoing research training transformer models at scale

Python 11,235 2,513 Updated Feb 2, 2025

Development repository for the Triton language and compiler

C++ 14,242 1,763 Updated Feb 3, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 35,999 5,449 Updated Feb 3, 2025

Git Source Code Mirror - This is a publish-only repository but pull requests can be turned into patches to the mailing list via GitGitGadget (https://gitgitgadget.github.io/). Please follow Documen…

C 53,399 25,877 Updated Feb 1, 2025

Python-based research interface for blackbox and hyperparameter optimization, based on the internal Google Vizier Service.

Python 1,530 99 Updated Feb 1, 2025
Next