yufansong

🎯

Focusing

Yufan Song yufansong

🎯

Focusing

SDE RisingWave Labs | Ex CMU, Snowflake, ByteDance

150 followers · 56 following

Risingwave Labs
Seattle
in/yufansong
@YufanSong98
https://scholar.google.com/citations?hl=en&user=cpZgsAUAAAAJ

Achievements

x3 x3

Achievements

x3 x3

Organizations

Stars

32 stars written in Python

Clear filter

gpt-engineer-org / gpt-engineer

Platform to experiment with the AI Software Engineer. Terminal based. NOTE: Very different from https://gptengineer.app

Python 52,629 6,844 Updated Nov 17, 2024

All-Hands-AI / OpenHands

🙌 OpenHands: Code Less, Make More

Python 38,440 4,341 Updated Dec 13, 2024

0voice / interview_internal_reference

2023年最新总结，阿里，腾讯，百度，美团，头条等技术面试题目，以及答案，专家出题人分析汇总。

Python 36,688 9,443 Updated May 20, 2024

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 36,023 4,441 Updated Dec 12, 2024

openai / gym

A toolkit for developing and comparing reinforcement learning algorithms.

Python 34,966 8,614 Updated Oct 11, 2024

ray-project / ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 34,411 5,850 Updated Dec 13, 2024

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 31,820 4,838 Updated Dec 13, 2024

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 22,600 2,215 Updated Nov 28, 2024

timqian / chinese-independent-blogs

中文独立博客列表

Python 20,816 2,491 Updated Dec 13, 2024

Vonng / ddia

《Designing Data-Intensive Application》DDIA中文翻译

Python 20,555 4,290 Updated Dec 10, 2024

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 14,616 1,370 Updated Dec 12, 2024

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 10,801 2,415 Updated Dec 13, 2024

axolotl-ai-cloud / axolotl

Go ahead and axolotl questions

Python 8,079 892 Updated Dec 13, 2024

zilliztech / GPTCache

Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

Python 7,278 510 Updated Sep 18, 2024

mit-han-lab / streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 6,713 372 Updated Jul 11, 2024

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 6,507 577 Updated Dec 12, 2024

traceloop / openllmetry

Open-source observability for your LLM application, based on OpenTelemetry

Python 5,178 678 Updated Dec 12, 2024

google-deepmind / dm_control

Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.

Python 3,849 675 Updated Dec 6, 2024

cosmicpython / book

A Book about Pythonic Application Architecture Patterns for Managing Complexity. Cosmos is the Opposite of Chaos you see. O'R. wouldn't actually let us call it "Cosmic Python" tho.

Python 3,409 527 Updated Jun 12, 2024

ModelTC / lightllm

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 2,682 215 Updated Dec 13, 2024

aws / aws-eks-best-practices

A best practices guide for day 2 operations, including operational excellence, security, reliability, performance efficiency, and cost optimization.

Python 2,058 501 Updated Nov 20, 2024

sustcsonglin / flash-linear-attention

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Python 1,418 72 Updated Dec 12, 2024

Open-Source-O1 / Open-O1

Python 988 30 Updated Nov 21, 2024

xdit-project / xDiT

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

Python 929 82 Updated Dec 10, 2024

YaxeZhang / Just-Code

Just Code ! 针对面试训练算法题，目前包括字节跳动面试题、 LeetCode 和剑指 offer ，持续扩容中 ⭐

Python 806 160 Updated Nov 9, 2020

xingyaoww / code-act

Official Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhang, Yunzhu Li, Hao Peng, Heng Ji.

Python 513 40 Updated May 23, 2024

microsoft / WindowsAgentArena

Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.

Python 509 50 Updated Nov 20, 2024

aws-neuron / aws-neuron-sdk

Powering AWS purpose-built machine learning chips. Blazing fast and cost effective, natively integrated into PyTorch and TensorFlow and integrated with your favorite AWS services

Python 475 155 Updated Dec 11, 2024

chenzomi12 / AIFoundation

AIFoundation 主要是指AI系统遇到大模型，从底层到上层如何系统级地支持大模型训练和推理，全栈的核心技术。

Python 366 48 Updated Dec 8, 2024

stanford-cs336 / spring2024-lectures

Python 182 31 Updated Nov 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yufan Song yufansong

Achievements

Achievements

Organizations

Block or report yufansong

Stars

gpt-engineer-org / gpt-engineer

All-Hands-AI / OpenHands

0voice / interview_internal_reference

hiyouga / LLaMA-Factory

openai / gym

ray-project / ray

vllm-project / vllm

hpcaitech / Open-Sora

timqian / chinese-independent-blogs

Vonng / ddia

Dao-AILab / flash-attention

NVIDIA / Megatron-LM

axolotl-ai-cloud / axolotl

zilliztech / GPTCache

mit-han-lab / streaming-llm

sgl-project / sglang

traceloop / openllmetry

google-deepmind / dm_control

cosmicpython / book

ModelTC / lightllm

aws / aws-eks-best-practices

sustcsonglin / flash-linear-attention

Open-Source-O1 / Open-O1

xdit-project / xDiT

YaxeZhang / Just-Code

xingyaoww / code-act

microsoft / WindowsAgentArena

aws-neuron / aws-neuron-sdk

chenzomi12 / AIFoundation

stanford-cs336 / spring2024-lectures