-
Notifications
You must be signed in to change notification settings - Fork 65
Issues: huggingface/optimum-neuron
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Loading compiled fails: Something isn't working
model_type=bert -> transformers
being used in compiled config.
bug
#744
opened Dec 2, 2024 by
michaelfeil
HF_HUB_OFFLINE
environment variable not being honoured for Neuron cache
bug
#741
opened Nov 24, 2024 by
unography
2 of 4 tasks
AttributeError: can't set attribute 'deepspeed_plugin'
bug
Something isn't working
#735
opened Nov 14, 2024 by
anushka0415
2 of 4 tasks
Size mismatch while loading consolidated checkpoints trained with Tensor parallelism for custom LLama Model
bug
Something isn't working
#734
opened Nov 9, 2024 by
unography
3 of 4 tasks
can't compile llama-3-8B or llama-3.1-8B with lora if batch size is more than 1
bug
Something isn't working
#709
opened Oct 5, 2024 by
anilozlu
3 of 4 tasks
Codellama generates wierd tokens with TGI 0.0.24
bug
Something isn't working
#704
opened Sep 25, 2024 by
pinak-p
1 of 4 tasks
ValueError: The NeuronTrainer only accept NeuronTrainingArguments, but <class 'optimum.neuron.training_args.Seq2SeqNeuronTrainingArguments'> was provided.
bug
Something isn't working
#693
opened Sep 6, 2024 by
industrialeaf
2 of 4 tasks
Cannot host Llama-3-8B exported by optimum-neuron with TGI contianer using optimum-neuron(0.0.24) and neuron-sdk(2.19.1)
bug
Something isn't working
#684
opened Aug 25, 2024 by
cszhz
2 of 4 tasks
Training output reports incorrect num examples when using DDP
bug
Something isn't working
Stale
#683
opened Aug 24, 2024 by
syl-taylor-aws
2 of 4 tasks
MPMD errors when enabling pipeline parallel for fine-tuning llama 3 8B model
bug
Something isn't working
Stale
#674
opened Jul 31, 2024 by
bingchen-liu
2 of 4 tasks
Underloaded Neuron Cores with Llama3
bug
Something isn't working
Stale
#672
opened Jul 30, 2024 by
dlptv
2 of 4 tasks
Llama 3 8B fine tuning shows nan value as loss
bug
Something isn't working
Stale
#660
opened Jul 20, 2024 by
BaiqingL
2 of 4 tasks
Llama3-8B finetuning shows runtime error of TDRV:v2_cc_execute
bug
Something isn't working
Stale
#658
opened Jul 17, 2024 by
jianyinglangaws
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.