-
Notifications
You must be signed in to change notification settings - Fork 65
Issues: huggingface/optimum-neuron
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
ValueError: The NeuronTrainer only accept NeuronTrainingArguments, but <class 'optimum.neuron.training_args.Seq2SeqNeuronTrainingArguments'> was provided.
bug
Something isn't working
#693
opened Sep 6, 2024 by
industrialeaf
updated Sep 18, 2024
2 of 4 tasks
Move neuron_parallel_compile outside of bash script
#706
opened Sep 26, 2024 by
jgray-aws
updated Sep 26, 2024
can't compile llama-3-8B or llama-3.1-8B with lora if batch size is more than 1
bug
Something isn't working
#709
opened Oct 5, 2024 by
anilozlu
updated Oct 7, 2024
3 of 4 tasks
Codellama generates wierd tokens with TGI 0.0.24
bug
Something isn't working
#704
opened Sep 25, 2024 by
pinak-p
updated Oct 8, 2024
1 of 4 tasks
Training output reports incorrect num examples when using DDP
bug
Something isn't working
Stale
#683
opened Aug 24, 2024 by
syl-taylor-aws
updated Oct 14, 2024
2 of 4 tasks
MPMD errors when enabling pipeline parallel for fine-tuning llama 3 8B model
bug
Something isn't working
Stale
#674
opened Jul 31, 2024 by
bingchen-liu
updated Oct 14, 2024
2 of 4 tasks
Underloaded Neuron Cores with Llama3
bug
Something isn't working
Stale
#672
opened Jul 30, 2024 by
dlptv
updated Oct 14, 2024
2 of 4 tasks
Llama3-8B finetuning shows runtime error of TDRV:v2_cc_execute
bug
Something isn't working
Stale
#658
opened Jul 17, 2024 by
jianyinglangaws
updated Oct 14, 2024
Llama 3 8B fine tuning shows nan value as loss
bug
Something isn't working
Stale
#660
opened Jul 20, 2024 by
BaiqingL
updated Oct 14, 2024
2 of 4 tasks
Cannot host Llama-3-8B exported by optimum-neuron with TGI contianer using optimum-neuron(0.0.24) and neuron-sdk(2.19.1)
bug
Something isn't working
#684
opened Aug 25, 2024 by
cszhz
updated Oct 14, 2024
2 of 4 tasks
Enable use of IterableDataset when training with DDP
#681
opened Aug 23, 2024 by
syl-taylor-aws
updated Nov 11, 2024
AttributeError: can't set attribute 'deepspeed_plugin'
bug
Something isn't working
#735
opened Nov 14, 2024 by
anushka0415
updated Nov 19, 2024
2 of 4 tasks
Size mismatch while loading consolidated checkpoints trained with Tensor parallelism for custom LLama Model
bug
Something isn't working
#734
opened Nov 9, 2024 by
unography
updated Nov 20, 2024
3 of 4 tasks
HF_HUB_OFFLINE
environment variable not being honoured for Neuron cache
bug
#741
opened Nov 24, 2024 by
unography
updated Nov 24, 2024
2 of 4 tasks
Loading compiled fails: Something isn't working
model_type=bert -> transformers
being used in compiled config.
bug
#744
opened Dec 2, 2024 by
michaelfeil
updated Dec 3, 2024
Add support for new Black Forest's model (Flux)
#676
opened Aug 6, 2024 by
mrrfr
updated Dec 9, 2024
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.