cheungdaven

🎯

Focusing

Shuai Zhang cheungdaven

🎯

Focusing

Amazon

307 followers · 107 following

Amazon Web Services
Santa Clara
shuaizhang.tech
@DavenCheung

Achievements

x2 x2 x3

Achievements

x2 x2 x3

Organizations

Stars

awslabs / multi-agent-orchestrator

Flexible and powerful framework for managing multiple AI agents and handling complex conversations

TypeScript 2,916 203 Updated Dec 11, 2024

mem0ai / mem0

The Memory layer for your AI apps

Python 23,338 2,152 Updated Dec 13, 2024

facebookresearch / lingua

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,304 223 Updated Dec 12, 2024

feifeibear / LLMSpeculativeSampling

Fast inference from large lauguage models via speculative decoding

Python 602 63 Updated Aug 22, 2024

horseee / Awesome-Efficient-LLM

A curated list for Efficient Large Language Models

Python 1,323 94 Updated Dec 9, 2024

NVIDIA-Merlin / NVTabular

NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.

Python 1,057 143 Updated Sep 3, 2024

anxndsgn / academic-homepage-template

Academic Homepage Template

JavaScript 56 13 Updated Oct 27, 2024

tianyi-lab / Reflection_Tuning

[ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning

Python 342 30 Updated Sep 6, 2024

llvm / llvm-project

The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.

LLVM 29,602 12,240 Updated Dec 14, 2024

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda 1,548 153 Updated Dec 13, 2024

autogluon / autogluon-rag

Retrieval-Augmented Generation in 3 Lines of Code!

Python 30 7 Updated Dec 5, 2024

google-research / timesfm

TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.

Python 3,865 332 Updated Dec 12, 2024

ModelTC / lightllm

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 2,683 215 Updated Dec 13, 2024

FudanDNN-NLP / RAG

This is an implementation of the paper: Searching for Best Practices in Retrieval-Augmented Generation (EMNLP2024)

Python 237 14 Updated Sep 25, 2024

amazon-science / steerfair

Discovering Bias in Latent Space: An Unsupervised Debiasing Approach (ICML 2024)

Jupyter Notebook 7 1 Updated Jun 20, 2024

Lightning-AI / lit-llama

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Python 6,013 518 Updated Sep 6, 2024

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 10,805 2,416 Updated Dec 13, 2024

amazon-science / piperag

PipeRAG: Fast Retrieval-Augmented Generation via Algorithm-System Co-design (KDD 2025)

Python 12 2 Updated Jun 14, 2024

FoundationVision / LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,396 58 Updated Aug 15, 2024

state-spaces / s4

Structured state space sequence models

Jupyter Notebook 2,497 299 Updated Jul 17, 2024

salesforce / logai

LogAI - An open-source library for log analytics and intelligence

Python 465 67 Updated Nov 14, 2024

amazon-science / camml

CaMML:Context-Aware MultiModal Learner for Large Models (ACL 2024)

Python 12 Updated Jun 18, 2024

NVIDIA / NeMo-Guardrails

NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

Python 4,256 405 Updated Dec 13, 2024

kamalkraj / e5-mistral-7b-instruct

Finetune mistral-7b-instruct for sentence embeddings

Python 74 17 Updated May 2, 2024

philschmid / deep-learning-pytorch-huggingface

Jupyter Notebook 704 165 Updated Nov 29, 2024

meta-llama / llama-recipes

Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting…

Jupyter Notebook 15,579 2,251 Updated Dec 12, 2024

amazon-science / comm-prompt

CoMM: Collaborative Multi-Agent, Multi-Reasoning-Path Prompting for Complex Problem Solving (NAACL 2024 Findings))

Python 14 Updated Apr 26, 2024

ollama / ollama

Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.

Go 102,647 8,195 Updated Dec 14, 2024

ContextualAI / gritlm

Generative Representational Instruction Tuning

Jupyter Notebook 573 41 Updated Nov 18, 2024

facebookresearch / SemDeDup

Code for "SemDeDup", a simple method for identifying and removing semantic duplicates from a dataset (data pairs which are semantically similar, but not exactly identical).

Python 117 13 Updated Oct 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Shuai Zhang cheungdaven

Achievements

Achievements

Organizations

Block or report cheungdaven

Stars

awslabs / multi-agent-orchestrator

mem0ai / mem0

facebookresearch / lingua

feifeibear / LLMSpeculativeSampling

horseee / Awesome-Efficient-LLM

NVIDIA-Merlin / NVTabular

anxndsgn / academic-homepage-template

tianyi-lab / Reflection_Tuning

llvm / llvm-project

flashinfer-ai / flashinfer

autogluon / autogluon-rag

google-research / timesfm

ModelTC / lightllm

FudanDNN-NLP / RAG

amazon-science / steerfair

Lightning-AI / lit-llama

NVIDIA / Megatron-LM

amazon-science / piperag

FoundationVision / LlamaGen

state-spaces / s4

salesforce / logai

amazon-science / camml

NVIDIA / NeMo-Guardrails

kamalkraj / e5-mistral-7b-instruct

philschmid / deep-learning-pytorch-huggingface

meta-llama / llama-recipes

amazon-science / comm-prompt

ollama / ollama

ContextualAI / gritlm

facebookresearch / SemDeDup