-
Notifications
You must be signed in to change notification settings - Fork 28.6k
Pull requests: huggingface/transformers
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Llama4: remove redundant transpose of router_logits
#37468
opened Apr 12, 2025 by
pbelevich
Loading…
fix: (llama4) fix no_split_modules to be picked up for fsdpv1 and v2 sharding
#37462
opened Apr 12, 2025 by
kmehant
Loading…
1 of 5 tasks
Fix interpolation of convnext image processor
#37460
opened Apr 11, 2025 by
chandrusuresh
Loading…
1 of 5 tasks
Remove torchvision requirement from AutoImageProcessor
#37457
opened Apr 11, 2025 by
LysandreJik
Loading…
[bug] deprecated deta load_cuda_kernel, MultiScaleDeformableAttention
#37443
opened Apr 11, 2025 by
chagmgang
Loading…
5 tasks
fix issue that some example with no trainer use accelerator.end_train…
#37435
opened Apr 10, 2025 by
we1559
Loading…
2 of 5 tasks
convert scale and zero to cuda when using HQQ backend
#37425
opened Apr 10, 2025 by
phymhan
Loading…
2 of 5 tasks
guard on model.eval when using torch.compile + FSDP2
#37413
opened Apr 10, 2025 by
winglian
Loading…
5 tasks
chore: standardize DeBERTa model card
#37409
opened Apr 10, 2025 by
Shoumik-Gandre
•
Draft
5 tasks done
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.