Issues: huggingface/transformers
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Issues occuring during parallel evaluation (using Trainer.evaluate)
#30767
opened May 12, 2024 by
psychocosine
2 of 4 tasks
For multiple GPUs: torch.cuda.empty_cache() stuck forever
#30766
opened May 11, 2024 by
animeshkumarpaul
2 of 4 tasks
Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained
#30762
opened May 11, 2024 by
yingqianch
4 tasks
Convert Helsinki-NLP model to huggingface
New model
#30761
opened May 11, 2024 by
nichellehouston
2 tasks
BART generate with min_new_tokens exceeds maximum length
#30759
opened May 11, 2024 by
vsocrates
2 of 4 tasks
TokenClassificationPipeline support is_split_into_words tokeniser parameter
#30757
opened May 11, 2024 by
swtb3
TypeError: 'list' object is not callable || Resume from checkpoint
#30754
opened May 11, 2024 by
satpalsr
4 tasks
recent version of Transformers seems to mess with forward/__call__. Breaks patching loss function
#30753
opened May 10, 2024 by
grahamannett
3 of 4 tasks
train_new_from_iterator does not properly modify the tokenizer's postprocessor's ids when using a Sequence postprocessor
#30752
opened May 10, 2024 by
dmcinerney
1 of 4 tasks
BitsNBytes 4 bit quantization error message typo and logical errors in error message handling
#30751
opened May 10, 2024 by
jkterry1
4 tasks
Bug: InformerModel, decoder_input torch.cat size of tensor mismatch error otherwise
#30750
opened May 10, 2024 by
jhzsquared
[Batched Whisper] ValueError on input mel features
Audio
#30740
opened May 10, 2024 by
kerem0comert
2 of 4 tasks
[DOCS] - Model outputs of RecurrentGemmaCausalLM doesn't align with the documentation
#30736
opened May 10, 2024 by
godjw
4 tasks
Meet problems when I use the file src/transformers/models/llama/convert_llama_weights_to_hf.py to transfer LlaMa-7B
#30734
opened May 9, 2024 by
wwxxyy1996
2 of 4 tasks
Mixtral past_key_values and output_router_logits incompatible
#30731
opened May 9, 2024 by
sorgfresser
2 of 4 tasks
Support for Multiple Datasets and Domain-Specific Loss Calculation in Trainer
Feature request
Request for a new feature
trainer
#30725
opened May 9, 2024 by
Ajmalshamsudheen
hub_strategy="every_save"
won't push the model to the Hub if large
#30724
opened May 9, 2024 by
alvarobartt
2 of 4 tasks
Add TableTransformerImageProcessor
Feature request
Request for a new feature
Vision
#30718
opened May 8, 2024 by
NielsRogge
Is
model. generate
supported during the training process?
#30713
opened May 8, 2024 by
sunxiaojie99
2 of 4 tasks
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.