Skip to content
View shjwudp's full-sized avatar

Organizations

@BaguaSys

Block or report shjwudp

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. c4-dataset-script c4-dataset-script Public

    Inspired by google c4, here is a series of colossal clean data cleaning scripts focused on CommonCrawl data processing. Including Chinese data processing and cleaning methods in MassiveText.

    Python 120 14

  2. megabyte megabyte Public

    A PyTorch implementation of MEGABYTE. This multi-scale transformer architecture has the excellent features of tokenization-free and sub-quadratic attention. The paper link: https://arxiv.org/abs/23…

    Python 4 3

  3. BaguaSys/bagua BaguaSys/bagua Public

    Bagua Speeds up PyTorch

    Python 876 83

  4. BaguaSys/bagua-net BaguaSys/bagua-net Public archive

    High performance NCCL plugin for Bagua.

    Rust 15 4

  5. shu shu Public

    中文书籍收录整理, Collection of Chinese Books

    Python 173 32

  6. blueprint-trainer blueprint-trainer Public

    Scaffolding for sequence model training research.

    Python 1