Skip to content

Issues: huggingface/optimum-neuron

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Move neuron_parallel_compile outside of bash script
#706 opened Sep 26, 2024 by jgray-aws updated Sep 26, 2024
can't compile llama-3-8B or llama-3.1-8B with lora if batch size is more than 1 bug Something isn't working
#709 opened Oct 5, 2024 by anilozlu updated Oct 7, 2024
3 of 4 tasks
Codellama generates wierd tokens with TGI 0.0.24 bug Something isn't working
#704 opened Sep 25, 2024 by pinak-p updated Oct 8, 2024
1 of 4 tasks
Training output reports incorrect num examples when using DDP bug Something isn't working Stale
#683 opened Aug 24, 2024 by syl-taylor-aws updated Oct 14, 2024
2 of 4 tasks
MPMD errors when enabling pipeline parallel for fine-tuning llama 3 8B model bug Something isn't working Stale
#674 opened Jul 31, 2024 by bingchen-liu updated Oct 14, 2024
2 of 4 tasks
Underloaded Neuron Cores with Llama3 bug Something isn't working Stale
#672 opened Jul 30, 2024 by dlptv updated Oct 14, 2024
2 of 4 tasks
Llama3-8B finetuning shows runtime error of TDRV:v2_cc_execute bug Something isn't working Stale
#658 opened Jul 17, 2024 by jianyinglangaws updated Oct 14, 2024
Llama 3 8B fine tuning shows nan value as loss bug Something isn't working Stale
#660 opened Jul 20, 2024 by BaiqingL updated Oct 14, 2024
2 of 4 tasks
Enable use of IterableDataset when training with DDP
#681 opened Aug 23, 2024 by syl-taylor-aws updated Nov 11, 2024
AttributeError: can't set attribute 'deepspeed_plugin' bug Something isn't working
#735 opened Nov 14, 2024 by anushka0415 updated Nov 19, 2024
2 of 4 tasks
Size mismatch while loading consolidated checkpoints trained with Tensor parallelism for custom LLama Model bug Something isn't working
#734 opened Nov 9, 2024 by unography updated Nov 20, 2024
3 of 4 tasks
HF_HUB_OFFLINE environment variable not being honoured for Neuron cache bug Something isn't working
#741 opened Nov 24, 2024 by unography updated Nov 24, 2024
2 of 4 tasks
stablediffusion (sdxl) ip-adapter support
#718 opened Oct 16, 2024 by Suprhimp updated Nov 29, 2024
Loading compiled fails: model_type=bert -> transformers being used in compiled config. bug Something isn't working
#744 opened Dec 2, 2024 by michaelfeil updated Dec 3, 2024
Support for Qwen2-VL
#747 opened Dec 9, 2024 by Chin-Vic updated Dec 9, 2024
Add support for new Black Forest's model (Flux)
#676 opened Aug 6, 2024 by mrrfr updated Dec 9, 2024
ProTip! Mix and match filters to narrow down what you’re looking for.