-
Notifications
You must be signed in to change notification settings - Fork 28.5k
Issues: huggingface/transformers
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
FastAPI with LLM inference does not release accumulated VRAM
bug
#37118
opened Mar 31, 2025 by
variable
4 tasks
The loss and gradient explosion caused by the trainer
bug
#37110
opened Mar 30, 2025 by
rangehow
1 of 4 tasks
Add Sdpa Support for Request for a new feature
Electra
Feature request
#37105
opened Mar 29, 2025 by
nnilayy
Feature Request: Support Canary Models
Feature request
Request for a new feature
#37098
opened Mar 29, 2025 by
fakerybakery
Release Tag Changed, Breaking Checksums, and AUR Package Building
#37090
opened Mar 28, 2025 by
daskol
LLaVa_mistral models are unrecognized
bug
New model
#37087
opened Mar 28, 2025 by
darshpatel1052
2 of 4 tasks
Do not update cache when use_cache=False and past_key_values are provided?
Feature request
Request for a new feature
#37078
opened Mar 28, 2025 by
PheelaV
A TypeError in modeling_utils.caching_allocator_warmup function
bug
#37074
opened Mar 28, 2025 by
ZeroMakesAll
2 of 4 tasks
a logic error in _preprocess function of Qwen2VLImageProcessor Class
bug
#37064
opened Mar 28, 2025 by
InsaneGe
4 tasks
Incorrect calculation of strides leading to loss of param data upon tensor parallel use while sliced model loading
bug
#37051
opened Mar 27, 2025 by
kmehant
1 of 4 tasks
Persistent generation issues with MT5 models (base and fine-tuned) across environments
#37048
opened Mar 27, 2025 by
Elpharran
Optionality of
attention_mask
argument in Attention classes/functions.
#37046
opened Mar 27, 2025 by
Godofnothing
run_mim.py script from image-pretraining example is not working
bug
#37020
opened Mar 26, 2025 by
jafraustro
1 of 4 tasks
SwitchTransformer: Initialization of tensor to collect expert results is incorrect for dropped tokens (from ML POV)
bug
#37017
opened Mar 26, 2025 by
mario-aws
2 of 4 tasks
Gemma3 adding new tokens <image_soft_token> has been added accidentally
bug
#37011
opened Mar 26, 2025 by
Serzhanov
4 tasks
GGUF model with architecture gemma3 is not supported yet.
bug
#37002
opened Mar 26, 2025 by
chunxingque
4 tasks
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.