-
Notifications
You must be signed in to change notification settings - Fork 28.1k
Issues: huggingface/transformers
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
SamImageProcessor do_convert_rgb not working correctly
bug
#36368
opened Feb 24, 2025 by
MSt-10
2 of 4 tasks
目前使用Ktransformers进行DEEPSEEK-R1满血版和4bit量化版模型进行推理,推理速度有多少tokens/s?对应的计算资源配置分别是多少?
#36363
opened Feb 24, 2025 by
William-Cai123
The arguments in
utils/modular_model_converter.py
is different from those in docs
#36362
opened Feb 24, 2025 by
zhoubay
warning bug in Qwen2DecoderLayer in transformers ==4.49
bug
#36361
opened Feb 24, 2025 by
Kyrie666
2 of 4 tasks
Are there any plans to provide some performance analysis tools for transformers?
Feature request
Request for a new feature
#36360
opened Feb 24, 2025 by
Hukongtao
Allow setting a seed for DataCollatorForLanguageModeling
Feature request
Request for a new feature
#36357
opened Feb 23, 2025 by
capemox
Groq inference provider
Feature request
Request for a new feature
#36353
opened Feb 23, 2025 by
VladOS95-cyber
Implement Titans Architecture with GRPO Fine-Tuning
New model
#36352
opened Feb 23, 2025 by
rajveer43
2 tasks
Support sliding_window for sdpa in qwen2
Feature request
Request for a new feature
#36351
opened Feb 23, 2025 by
cyr0930
Failed to import transformers.models.auto.modeling_auto because numpy.core.multiarray failed to import
bug
#36343
opened Feb 22, 2025 by
forrestbao
4 tasks
suppress_tokens=[] should be legal as some older. whisper models rely on this
bug
#36341
opened Feb 22, 2025 by
Lewington-pitsos
2 of 4 tasks
AudioClassificationPipelineTests::test_small_model_pt_fp16 fails for CUDA/XPU but passes for CPU
#36340
opened Feb 21, 2025 by
dvrogozh
Making apply_rotary_pos_emb generic
Feature request
Request for a new feature
#36339
opened Feb 21, 2025 by
xuzifei-dmatrix
Assisted generation slower than with base model alone
bug
#36337
opened Feb 21, 2025 by
sahilsuneja1
2 of 4 tasks
Some of test/utils tests fail being invalidated by tests/utils/test_import_utils.py::test_clear_import_cache
#36334
opened Feb 21, 2025 by
dvrogozh
TypeError: CustomTrainer.compute_loss() got an unexpected keyword argument 'num_items_in_batch'
bug
#36331
opened Feb 21, 2025 by
ruidazeng
2 of 4 tasks
Unable to use Seq2SeqTrainingArguments and Seq2SeqTrainer
bug
#36330
opened Feb 21, 2025 by
shivanraptor
3 of 4 tasks
Question for community: We're considering adding
pydantic
as a base requirement to 🤗 transformers
contributions-welcome
#36329
opened Feb 21, 2025 by
gante
llama tokenizer encode -> decode is not same
bug
#36325
opened Feb 21, 2025 by
pocca2048
2 of 4 tasks
DS3 zero3_save_16bit_model is not compatible with resume_from_checkpoint
bug
#36317
opened Feb 21, 2025 by
starmpcc
2 of 4 tasks
Include "time" as option to save_strategy (and log and eval too!)
Feature request
Request for a new feature
#36310
opened Feb 20, 2025 by
davidhughhenrymack
Bug about num_update_steps_per_epoch in function _inner_training_loop
bug
#36297
opened Feb 20, 2025 by
onenotell
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.