Skip to content
View baoleai's full-sized avatar
🤗
🤗

Block or report baoleai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • Quantized Attention that achieves speedups of 2.1-3.1x and 2.7-5.1x compared to FlashAttention2 and xformers, respectively, without lossing end-to-end metrics across various models.

    Cuda Apache License 2.0 Updated Feb 28, 2025
  • diffusers Public

    Forked from huggingface/diffusers

    🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

    Python Apache License 2.0 Updated Nov 20, 2024
  • swift Public

    Forked from modelscope/ms-swift

    魔搭大模型训练推理工具箱,支持LLaMA、千问、ChatGLM、BaiChuan等多种模型及LoRA等多种训练方式(The LLM training/inference framework of ModelScope community, Support various models like LLaMA, Qwen, ChatGLM, Baichuan and others, and tr…

    Python Apache License 2.0 Updated Nov 5, 2024
  • pytorch Public

    Forked from pytorch/pytorch

    Tensors and Dynamic neural networks in Python with strong GPU acceleration

    Python Other Updated Aug 1, 2024
  • xla Public

    Forked from pytorch/xla

    Enabling PyTorch on XLA Devices (e.g. Google TPU)

    C++ Other Updated Aug 1, 2024
  • Fast and easy distributed model training examples.

    Python Apache License 2.0 Updated Jul 3, 2024
  • 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

    Python Apache License 2.0 Updated Jun 6, 2024
  • DiT Public

    Forked from facebookresearch/DiT

    Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

    Python Other Updated Mar 4, 2024
  • openxla-xla Public

    Forked from openxla/xla

    A machine learning compiler for GPUs, CPUs, and ML accelerators

    C++ Apache License 2.0 Updated Nov 23, 2023
  • A graph learning library for PyTorch that makes distributed GNN training and inference easy and efficient.

    Python 1 Apache License 2.0 Updated May 24, 2023
  • cub Public

    Forked from NVIDIA/cub

    Cooperative primitives for CUDA C++.

    Cuda BSD 3-Clause "New" or "Revised" License Updated Feb 28, 2023
  • Graph Neural Network Library for PyTorch

    Python MIT License Updated Nov 30, 2022
  • graph-learn Public

    Forked from alibaba/graph-learn

    graph-learn

    C++ Apache License 2.0 Updated Jul 13, 2022