Skip to content

Actions: huggingface/trl

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
20,586 workflow runs
20,586 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Tests latest TRL release with dev dependencies
Tests latest TRL release with dev dependencies #79: Scheduled
February 21, 2025 00:18 38m 14s main
February 21, 2025 00:18 38m 14s
Cleanup Cache
Cleanup Cache #698: Scheduled
February 21, 2025 00:04 14s main
February 21, 2025 00:04 14s
updated DPO default values for alpha and tau
Build PR Documentation #6682: Pull request #2918 opened by Ishan-Kumar2
February 20, 2025 22:44 Action required Ishan-Kumar2:update_alpha_trdpo
February 20, 2025 22:44 Action required
updated DPO default values for alpha and tau
Tests #7568: Pull request #2918 opened by Ishan-Kumar2
February 20, 2025 22:44 Action required Ishan-Kumar2:update_alpha_trdpo
February 20, 2025 22:44 Action required
GRPO from VLM models?
Hugging Face Issue Labeler #183: Issue #2917 opened by dipta007
February 20, 2025 19:10 32s
February 20, 2025 19:10 32s
pages build and deployment
pages-build-deployment #1158: by qgallouedec
February 20, 2025 18:51 37s main
February 20, 2025 18:51 37s
I want to solve this issue: ValueError: Unable to create tensor
Hugging Face Issue Labeler #182: Issue #2916 opened by jbw3016
February 20, 2025 17:09 37s
February 20, 2025 17:09 37s
Upload PR Documentation
Upload PR Documentation #4802: completed by qgallouedec
February 20, 2025 16:34 33s
February 20, 2025 16:34 33s
🐦‍🔥 6x faster GRPO with multi-step optimization
Build PR Documentation #6681: Pull request #2899 synchronize by qgallouedec
February 20, 2025 16:29 5m 34s multi-step-grpi
February 20, 2025 16:29 5m 34s
🐦‍🔥 6x faster GRPO with multi-step optimization
Tests #7567: Pull request #2899 synchronize by qgallouedec
February 20, 2025 16:29 40m 28s multi-step-grpi
February 20, 2025 16:29 40m 28s
clarify
Secret Leaks #2501: Commit 3f2219d pushed by qgallouedec
February 20, 2025 16:29 1m 40s multi-step-grpi
February 20, 2025 16:29 1m 40s
SFTTrainer: Why do we always switch to chatML?
Hugging Face Issue Labeler #181: Issue #2915 opened by jbw3016
February 20, 2025 15:42 28s
February 20, 2025 15:42 28s
Clarification on KL Divergence Computation in GRPOTrainer
Hugging Face Issue Labeler #180: Issue #2914 opened by zhaopku
February 20, 2025 14:02 34s
February 20, 2025 14:02 34s
Upload PR Documentation
Upload PR Documentation #4801: completed by qgallouedec
February 20, 2025 13:29 38s
February 20, 2025 13:29 38s
🐦‍🔥 6x faster GRPO with multi-step optimization
Build PR Documentation #6680: Pull request #2899 synchronize by qgallouedec
February 20, 2025 13:26 3m 32s multi-step-grpi
February 20, 2025 13:26 3m 32s
🐦‍🔥 6x faster GRPO with multi-step optimization
Tests #7566: Pull request #2899 synchronize by qgallouedec
February 20, 2025 13:26 34m 12s multi-step-grpi
February 20, 2025 13:26 34m 12s
beta position
Secret Leaks #2500: Commit a8cc776 pushed by qgallouedec
February 20, 2025 13:26 19s multi-step-grpi
February 20, 2025 13:26 19s
Upload PR Documentation
Upload PR Documentation #4800: completed by qgallouedec
February 20, 2025 13:17 29s
February 20, 2025 13:17 29s
🐦‍🔥 6x faster GRPO with multi-step optimization
Build PR Documentation #6679: Pull request #2899 synchronize by qgallouedec
February 20, 2025 13:13 3m 24s multi-step-grpi
February 20, 2025 13:13 3m 24s
🐦‍🔥 6x faster GRPO with multi-step optimization
Tests #7565: Pull request #2899 synchronize by qgallouedec
February 20, 2025 13:13 36m 47s multi-step-grpi
February 20, 2025 13:13 36m 47s
test
Secret Leaks #2499: Commit 7ab20eb pushed by qgallouedec
February 20, 2025 13:13 22s multi-step-grpi
February 20, 2025 13:13 22s
Upload PR Documentation
Upload PR Documentation #4799: completed by qgallouedec
February 20, 2025 13:11 27s
February 20, 2025 13:11 27s
🐦‍🔥 6x faster GRPO with multi-step optimization
Build PR Documentation #6678: Pull request #2899 synchronize by qgallouedec
February 20, 2025 13:07 3m 16s multi-step-grpi
February 20, 2025 13:07 3m 16s
🐦‍🔥 6x faster GRPO with multi-step optimization
Tests #7564: Pull request #2899 synchronize by qgallouedec
February 20, 2025 13:07 40m 46s multi-step-grpi
February 20, 2025 13:07 40m 46s
February 20, 2025 13:07 18s