Skip to content

Actions: huggingface/trl

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
20,589 workflow runs
20,589 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

simple question: SFTTrainer ValueError
Hugging Face Issue Labeler #185: Issue #2920 opened by jbw3016
February 21, 2025 02:11 30s
February 21, 2025 02:11 30s
[WIP] [Liger] Liger KTO support
Build PR Documentation #6684: Pull request #2812 synchronize by vaibhavjindal
February 21, 2025 01:32 Action required vaibhavjindal:liger-kto
February 21, 2025 01:32 Action required
[WIP] [Liger] Liger KTO support
Tests #7570: Pull request #2812 synchronize by vaibhavjindal
February 21, 2025 01:32 Action required vaibhavjindal:liger-kto
February 21, 2025 01:32 Action required
Build Docker images (scheduled)
Build Docker images (scheduled) #409: Scheduled
February 21, 2025 01:30 10m 55s main
February 21, 2025 01:30 10m 55s
Add GRPO Trainer support for third-party accelerators
Build PR Documentation #6683: Pull request #2836 synchronize by ji-huazhong
February 21, 2025 01:01 Action required ji-huazhong:npu
February 21, 2025 01:01 Action required
Add GRPO Trainer support for third-party accelerators
Tests #7569: Pull request #2836 synchronize by ji-huazhong
February 21, 2025 01:01 Action required ji-huazhong:npu
February 21, 2025 01:01 Action required
Fine tuning "thinking"/"reasoning" models
Hugging Face Issue Labeler #184: Issue #2919 opened by GhostDog98
February 21, 2025 00:47 35s
February 21, 2025 00:47 35s
Tests latest TRL release with dev dependencies
Tests latest TRL release with dev dependencies #79: Scheduled
February 21, 2025 00:18 38m 14s main
February 21, 2025 00:18 38m 14s
Cleanup Cache
Cleanup Cache #698: Scheduled
February 21, 2025 00:04 14s main
February 21, 2025 00:04 14s
updated DPO default values for alpha and tau
Build PR Documentation #6682: Pull request #2918 opened by Ishan-Kumar2
February 20, 2025 22:44 Action required Ishan-Kumar2:update_alpha_trdpo
February 20, 2025 22:44 Action required
updated DPO default values for alpha and tau
Tests #7568: Pull request #2918 opened by Ishan-Kumar2
February 20, 2025 22:44 Action required Ishan-Kumar2:update_alpha_trdpo
February 20, 2025 22:44 Action required
GRPO from VLM models?
Hugging Face Issue Labeler #183: Issue #2917 opened by dipta007
February 20, 2025 19:10 32s
February 20, 2025 19:10 32s
pages build and deployment
pages-build-deployment #1158: by qgallouedec
February 20, 2025 18:51 37s main
February 20, 2025 18:51 37s
I want to solve this issue: ValueError: Unable to create tensor
Hugging Face Issue Labeler #182: Issue #2916 opened by jbw3016
February 20, 2025 17:09 37s
February 20, 2025 17:09 37s
Upload PR Documentation
Upload PR Documentation #4802: completed by qgallouedec
February 20, 2025 16:34 33s
February 20, 2025 16:34 33s
🐦‍🔥 6x faster GRPO with multi-step optimization
Build PR Documentation #6681: Pull request #2899 synchronize by qgallouedec
February 20, 2025 16:29 5m 34s multi-step-grpi
February 20, 2025 16:29 5m 34s
🐦‍🔥 6x faster GRPO with multi-step optimization
Tests #7567: Pull request #2899 synchronize by qgallouedec
February 20, 2025 16:29 40m 28s multi-step-grpi
February 20, 2025 16:29 40m 28s
clarify
Secret Leaks #2501: Commit 3f2219d pushed by qgallouedec
February 20, 2025 16:29 1m 40s multi-step-grpi
February 20, 2025 16:29 1m 40s
SFTTrainer: Why do we always switch to chatML?
Hugging Face Issue Labeler #181: Issue #2915 opened by jbw3016
February 20, 2025 15:42 28s
February 20, 2025 15:42 28s
Clarification on KL Divergence Computation in GRPOTrainer
Hugging Face Issue Labeler #180: Issue #2914 opened by zhaopku
February 20, 2025 14:02 34s
February 20, 2025 14:02 34s
Upload PR Documentation
Upload PR Documentation #4801: completed by qgallouedec
February 20, 2025 13:29 38s
February 20, 2025 13:29 38s
🐦‍🔥 6x faster GRPO with multi-step optimization
Build PR Documentation #6680: Pull request #2899 synchronize by qgallouedec
February 20, 2025 13:26 3m 32s multi-step-grpi
February 20, 2025 13:26 3m 32s
🐦‍🔥 6x faster GRPO with multi-step optimization
Tests #7566: Pull request #2899 synchronize by qgallouedec
February 20, 2025 13:26 34m 12s multi-step-grpi
February 20, 2025 13:26 34m 12s