Skip to content

Pull requests: huggingface/optimum-quanto

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

fix(library): Propagate upstream Marlin kernel fix Stale
#366 by ahadnagy was closed Jan 29, 2025 Loading…
3 of 4 tasks
fix(library): only compile CUDA extension on Linux
#365 by dacorvo was merged Jan 10, 2025 Loading…
Add repr for QuantizedTransformersModel
#357 by imba-tjd was merged Nov 25, 2024 Loading…
2 of 4 tasks
enable qbitstensor test on xpu
#350 by dacorvo was merged Nov 12, 2024 Loading…
[tests] enable testing for xpu (rebased)
#349 by dacorvo was merged Nov 12, 2024 Loading…
[tests] enable test_weight_qbits_tensor_linear_cuda on xpu devices
#345 by faaany was closed Nov 12, 2024 Loading…
2 of 4 tasks
[tests] enable testing for xpu
#344 by faaany was closed Nov 12, 2024 Loading…
2 of 4 tasks
Support QLayerNorm without weights
#341 by dacorvo was merged Oct 29, 2024 Loading…
fix: use reshape instead of view
#338 by dacorvo was merged Oct 24, 2024 Loading…
Switched linters, black -> ruff
#334 by ishandeva was merged Oct 8, 2024 Loading…
3 of 4 tasks
Add marlin int4 kernel
#333 by dacorvo was merged Oct 10, 2024 Loading…
Add hip support
#330 by dacorvo was merged Oct 4, 2024 Loading…
Refactor extensions
#329 by dacorvo was merged Oct 4, 2024 Loading…
Remove overheads in library
#328 by dacorvo was merged Oct 3, 2024 Loading…
Fix lumina
#326 by dacorvo was merged Oct 1, 2024 Loading…
Fix missing call in QuantizedTransformersModel
#325 by dacorvo was merged Sep 30, 2024 Loading…
refactor(library): reduce overhead in marlin op
#323 by dacorvo was merged Sep 30, 2024 Loading…
Ci move
#321 by glegendre01 was merged Sep 27, 2024 Loading…
4 tasks
Stricter optimized tensor tests
#320 by dacorvo was merged Sep 26, 2024 Loading…
chore: minimal python version is 3.9
#318 by dacorvo was merged Sep 25, 2024 Loading…
Refactor AWQ gemm
#317 by dacorvo was merged Sep 25, 2024 Loading…
More refactoring
#316 by dacorvo was merged Sep 20, 2024 Loading…
Add marlin int4 kernel
#315 by dacorvo was closed Sep 27, 2024 Draft
Refactor QBitsTensor subclasses
#314 by dacorvo was merged Sep 20, 2024 Loading…
feat: e4m3fnuz added
#310 by dacorvo was merged Sep 17, 2024 Loading…
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.