Skip to content

Actions: huggingface/trl

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
20,587 workflow runs
20,587 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

🪪 Adds profiling decorators for GRPOTrainer (#2889)
Secret Leaks #2491: Commit a92e00e pushed by edbeeching
February 20, 2025 08:57 17s main
February 20, 2025 08:57 17s
🪪 Adds profiling decorators for GRPOTrainer (#2889)
Tests #7557: Commit a92e00e pushed by edbeeching
February 20, 2025 08:57 37m 25s main
February 20, 2025 08:57 37m 25s
🪪 Adds profiling decorators for GRPOTrainer (#2889)
Slow tests (on push) #522: Commit a92e00e pushed by edbeeching
February 20, 2025 08:57 26m 9s main
February 20, 2025 08:57 26m 9s
pages build and deployment
pages-build-deployment #1157: by edbeeching
February 20, 2025 08:57 44s main
February 20, 2025 08:57 44s
Getting an error while using a PEFT model as a reward model in PPO training.
Hugging Face Issue Labeler #178: Issue #2911 opened by Tarak200
February 20, 2025 06:58 31s
February 20, 2025 06:58 31s
Remove CUDA synchronization in mean_token_accuracy
Build PR Documentation #6671: Pull request #2902 synchronize by cyyever
February 20, 2025 03:33 Action required cyyever:sync_point
February 20, 2025 03:33 Action required
Remove CUDA synchronization in mean_token_accuracy
Tests #7556: Pull request #2902 synchronize by cyyever
February 20, 2025 03:33 Action required cyyever:sync_point
February 20, 2025 03:33 Action required
L447 of GRPO trainer 'num_return_sequences=self.num_generations'
Hugging Face Issue Labeler #177: Issue #2910 opened by zhengqigao
February 20, 2025 01:57 26s
February 20, 2025 01:57 26s
Build Docker images (scheduled)
Build Docker images (scheduled) #408: Scheduled
February 20, 2025 01:30 11m 1s main
February 20, 2025 01:30 11m 1s
Cannot import name 'shard_checkpoint' (possibly deprecated in transformers)
Hugging Face Issue Labeler #176: Issue #2909 opened by anshuln2
February 20, 2025 00:20 37s
February 20, 2025 00:20 37s
Tests latest TRL release with dev dependencies
Tests latest TRL release with dev dependencies #78: Scheduled
February 20, 2025 00:18 24m 44s main
February 20, 2025 00:18 24m 44s
Cleanup Cache
Cleanup Cache #697: Scheduled
February 20, 2025 00:04 21s main
February 20, 2025 00:04 21s
should work now
Secret Leaks #2490: Commit a178fd9 pushed by qgallouedec
February 19, 2025 23:01 22s multi-step-grpi
February 19, 2025 23:01 22s
GRPO: Enable updating the reference model for KL divergence penalty calculation
Hugging Face Issue Labeler #175: Issue #2908 opened by ko-redtruck
February 19, 2025 21:03 28s
February 19, 2025 21:03 28s
What‘s the GRPOTrainer's error 浮点数例外(吐核)
Hugging Face Issue Labeler #173: Issue #2906 opened by Tuziking
February 19, 2025 15:16 52s
February 19, 2025 15:16 52s
How to use GRPOTrainer to train a LLM for code generation? What is the format of the dataset?
Hugging Face Issue Labeler #172: Issue #2905 opened by xiangxinhello
February 19, 2025 12:38 19s
February 19, 2025 12:38 19s
fix eval sampler
Secret Leaks #2489: Commit 9ce9b98 pushed by qgallouedec
February 19, 2025 11:27 16s multi-step-grpi
February 19, 2025 11:27 16s
update the loss computation
Secret Leaks #2488: Commit 0bf4f23 pushed by qgallouedec
February 19, 2025 11:27 20s multi-step-grpi
February 19, 2025 11:27 20s
test sampler
Secret Leaks #2487: Commit ee19148 pushed by qgallouedec
February 19, 2025 11:26 18s multi-step-grpi
February 19, 2025 11:26 18s
parameterize enable_prefix_caching
Build PR Documentation #6670: Pull request #2900 synchronize by ji-huazhong
February 19, 2025 11:24 4m 40s ji-huazhong:issue-2798
February 19, 2025 11:24 4m 40s
parameterize enable_prefix_caching
Tests #7555: Pull request #2900 synchronize by ji-huazhong
February 19, 2025 11:24 37m 15s ji-huazhong:issue-2798
February 19, 2025 11:24 37m 15s
parameterize enable_prefix_caching
Build PR Documentation #6669: Pull request #2900 synchronize by ji-huazhong
February 19, 2025 11:20 Action required ji-huazhong:issue-2798
February 19, 2025 11:20 Action required
parameterize enable_prefix_caching
Tests #7554: Pull request #2900 synchronize by ji-huazhong
February 19, 2025 11:20 Action required ji-huazhong:issue-2798
February 19, 2025 11:20 Action required
Save memory when layers are shared with ref model?
Hugging Face Issue Labeler #171: Issue #2904 opened by raphael-sch
February 19, 2025 10:58 37s
February 19, 2025 10:58 37s