Reinforcement learning ∩ LLMs, Generative models, Artificial intelligence
- San Francisco, CA
- alecwangcq.github.io
Highlights
- Pro
Pinned Loading
-
KFAC-Pytorch
KFAC-Pytorch PublicPytorch implementation of KFAC and E-KFAC (Natural Gradient).
-
EigenDamage-Pytorch
EigenDamage-Pytorch PublicCode for "EigenDamage: Structured Pruning in the Kronecker-Factored Eigenbasis" https://arxiv.org/abs/1905.05934
-
f-divergence-dpo
f-divergence-dpo PublicDirect preference optimization with f-divergences.
-
gd-zhang/Weight-Decay
gd-zhang/Weight-Decay PublicRegularization, Neural Network Training Dynamics
Python 14
-
ssydasheng/Neural-Kernel-Network
ssydasheng/Neural-Kernel-Network PublicCode for "Differentiable Compositional Kernel Learning for Gaussian Processes" https://arxiv.org/abs/1806.04326
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.