-
Notifications
You must be signed in to change notification settings - Fork 69
Pull requests: huggingface/optimum-quanto
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[
core
/ FEAT] Early version of pack / unpack int4 / int2 weights in uint8
#70
by younesbelkada
was merged Jan 19, 2024
Loading…
1 of 2 tasks
Benchmarks: Add missing latency and accuracy benchmarks
#109
by younesbelkada
was merged Mar 6, 2024
Loading…
feat(example): add quantize stablediffusion example
#181
by thliang01
was merged Apr 17, 2024
Loading…
3 tasks done
Avoid composite gradients in quantized linear function
#187
by dacorvo
was merged Apr 23, 2024
Loading…
feat: implement load and save support from the Hub.
#263
by sayakpaul
was merged Aug 22, 2024
Loading…
docs: fix typo in file name s/READMD.md/README.md/
#268
by dvrogozh
was merged Aug 21, 2024
Loading…
Fix a bug that prevents specifying include patterns for quantifying
#271
by kaibioinfo
was merged Aug 14, 2024
Loading…
Suggestion for https://github.com/huggingface/optimum-quanto/pull/263
#279
by Wauplin
was merged Aug 14, 2024
Loading…
fix: adjust _convert_weight_to_int4pack_cpu input weights for pytorch>=2.5
#286
by dvrogozh
was merged Aug 20, 2024
Loading…
FIX: Enable non-strict loading of state dicts
#295
by BenjaminBossan
was merged Aug 27, 2024
Loading…
3 of 4 tasks
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.