Articles by Zhoutong
Activity
-
We outperform Llama 70B with Llama 3B on hard math by scaling test-time compute 🔥 How? By combining step-wise reward models with tree search…
We outperform Llama 70B with Llama 3B on hard math by scaling test-time compute 🔥 How? By combining step-wise reward models with tree search…
Liked by Zhoutong Fu
-
We just introduced Multi-Threading (tested with #NoGIL Python) and Multi-Dataset support to #pytorch dataloading! Check out torchdata.nodes, a…
We just introduced Multi-Threading (tested with #NoGIL Python) and Multi-Dataset support to #pytorch dataloading! Check out torchdata.nodes, a…
Liked by Zhoutong Fu
-
Phi-4 is coming for Christmas! 🎄 Microsoft just announced Phi-4, a 14B LLM that outperforms GPT-4o on STEM-focused QA. Built using large-scale…
Phi-4 is coming for Christmas! 🎄 Microsoft just announced Phi-4, a 14B LLM that outperforms GPT-4o on STEM-focused QA. Built using large-scale…
Liked by Zhoutong Fu
Experience
Education
Courses
-
Applied Multivariate Analysis
STATS 206
-
Computational Finance
MATH 623
-
Computer Organization and Systems
CS 107
-
Data Mining and Analysis
STATS 202
-
Discrete Stochastic Process
STATS 526
-
From Language To Information
CS 124
-
Information Retrieval and Web Search
CS 276
-
Linear Models
STATS 305
-
Linear Programming
IOE 510
-
Machine Learning
CS 229
-
Mining Massive Data Sets
CS 246
-
Modern Algorithm Toolbox
CS 168
-
Modern Applied Statistics: Data Mining
STATS 315B
-
Modern Applied Statistics: Learning
STATS 315A
-
Object Oriented Programming Design
CS 108
-
Paradigms for Computing with Data
STATS 290
-
Probability Theory
MATH 525
-
Programming Abstraction
CS 106B
-
Statistics for Financial Data
STATS 509
-
Stochastic Processes
STATS 219
-
Time Series Analysis
STATS 207
Languages
-
English
Full professional proficiency
-
Chinese
Native or bilingual proficiency
More activity by Zhoutong
-
𝐒𝐜𝐚𝐥𝐞 𝐭𝐞𝐬𝐭-𝐭𝐢𝐦𝐞 𝐜𝐨𝐦𝐩𝐮𝐭𝐞—𝐟𝐨𝐫 𝐞𝐦𝐛𝐞𝐝𝐝𝐢𝐧𝐠 𝐦𝐨𝐝𝐞𝐥𝐬! If you could achieve 3x better zero-shot classification results…
𝐒𝐜𝐚𝐥𝐞 𝐭𝐞𝐬𝐭-𝐭𝐢𝐦𝐞 𝐜𝐨𝐦𝐩𝐮𝐭𝐞—𝐟𝐨𝐫 𝐞𝐦𝐛𝐞𝐝𝐝𝐢𝐧𝐠 𝐦𝐨𝐝𝐞𝐥𝐬! If you could achieve 3x better zero-shot classification results…
Liked by Zhoutong Fu
-
Gemini 2 Flash is now available on AI Studio and Vertex AI. It beats Gemini 1.5 Pro on all benchmarks handily while being 2X faster 🚀 🔥. But that's…
Gemini 2 Flash is now available on AI Studio and Vertex AI. It beats Gemini 1.5 Pro on all benchmarks handily while being 2X faster 🚀 🔥. But that's…
Liked by Zhoutong Fu
-
Reverse Thinking Makes LLMs Stronger Reasoners (RevThink) 🚀🚀 Humans often refine their thinking by working backward from solutions, and our…
Reverse Thinking Makes LLMs Stronger Reasoners (RevThink) 🚀🚀 Humans often refine their thinking by working backward from solutions, and our…
Liked by Zhoutong Fu
-
Introducing the first open-source optimized post-training losses in Liger Kernel with ~80% memory reduction, featuring DPO, CPO, ORPO, SimPO, JSD…
Introducing the first open-source optimized post-training losses in Liger Kernel with ~80% memory reduction, featuring DPO, CPO, ORPO, SimPO, JSD…
Liked by Zhoutong Fu
-
A Survey on LLMs-as-Judges Presents a comprehensive survey of the LLMs-as-judges paradigm from five key perspectives: Functionality, Methodology…
A Survey on LLMs-as-Judges Presents a comprehensive survey of the LLMs-as-judges paradigm from five key perspectives: Functionality, Methodology…
Liked by Zhoutong Fu
-
Huge accomplishment by the Google Quantum AI lab. The new Willow chip takes 5 minutes to run the Random Circuit Sampling (RCS) benchmark, which…
Huge accomplishment by the Google Quantum AI lab. The new Willow chip takes 5 minutes to run the Random Circuit Sampling (RCS) benchmark, which…
Liked by Zhoutong Fu
-
3x more tokens and 13x faster generations than vLLM? 👀 Hugging Face TGI 3.0 released! 🎉TGI 3.0 dramatically improves LLM inference processing by 3x…
3x more tokens and 13x faster generations than vLLM? 👀 Hugging Face TGI 3.0 released! 🎉TGI 3.0 dramatically improves LLM inference processing by 3x…
Liked by Zhoutong Fu
-
What is better than an LLM as a Judge? Right, an Agent as a Judge! Meta created an Agent-as-a-Judge to evaluate code agents to enable intermediate…
What is better than an LLM as a Judge? Right, an Agent as a Judge! Meta created an Agent-as-a-Judge to evaluate code agents to enable intermediate…
Liked by Zhoutong Fu
-
vLLM Joins PyTorch Ecosystem 🎉 Easy, Fast, and Cheap LLM Serving for Everyone vLLM has always had a strong connection with the PyTorch project. It…
vLLM Joins PyTorch Ecosystem 🎉 Easy, Fast, and Cheap LLM Serving for Everyone vLLM has always had a strong connection with the PyTorch project. It…
Liked by Zhoutong Fu
-
The OpenAI Sora UI is made for Gen-Z! It is very similar to CapCut, TikTok, or Instagram Reels. That's a genius customer acquisition! Every young…
The OpenAI Sora UI is made for Gen-Z! It is very similar to CapCut, TikTok, or Instagram Reels. That's a genius customer acquisition! Every young…
Liked by Zhoutong Fu
-
Llama Update! 🎉 Meta just released Llama 3.3 70B with OpenAI GPT-4o and Anthropic Claude Haiku 3.5 performance on Hugging Face! Achieved through…
Llama Update! 🎉 Meta just released Llama 3.3 70B with OpenAI GPT-4o and Anthropic Claude Haiku 3.5 performance on Hugging Face! Achieved through…
Liked by Zhoutong Fu
-
Can KV cache optimization go beyond just reducing memory footprint? Our latest work, SwiftKV, does exactly that—cutting prefill computation by half…
Can KV cache optimization go beyond just reducing memory footprint? Our latest work, SwiftKV, does exactly that—cutting prefill computation by half…
Liked by Zhoutong Fu
-
🔥 NVLink Distributed GEMMs natively in CUTLASS for all your tensor parallelism needs 🔥 https://lnkd.in/eJWWKmYA Work done by Ali Hassani over…
🔥 NVLink Distributed GEMMs natively in CUTLASS for all your tensor parallelism needs 🔥 https://lnkd.in/eJWWKmYA Work done by Ali Hassani over…
Liked by Zhoutong Fu
-
🌟 Thrilled to share that our tutorial proposal titled "Efficient algorithms for leveraging LLMs for Generative and Predictive Recommender Systems"…
🌟 Thrilled to share that our tutorial proposal titled "Efficient algorithms for leveraging LLMs for Generative and Predictive Recommender Systems"…
Liked by Zhoutong Fu
Other similar profiles
Explore collaborative articles
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
Explore More