Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Nice Work on CodeRM but Many Missing Citations #1

Open
ashwin296 opened this issue Feb 7, 2025 · 2 comments
Open

Nice Work on CodeRM but Many Missing Citations #1

ashwin296 opened this issue Feb 7, 2025 · 2 comments

Comments

@ashwin296
Copy link

ashwin296 commented Feb 7, 2025

This is a really nice work that contributes to new reward models in the coding domain.

However, many relevant prior works are not cited. The claim that "very few models have explored the potential of reinforcement learning" in the introduction is misleading (currently, only CodeRL is cited in Section 5.4).

To name a few:

It would be great to clarify the contributions to reward model training (which are quite valuable) and adjust the phrasing to better reflect prior works.

Thanks again for open-sourcing!

-- a Reinforcement Learner : )

@wenhuchen
Copy link
Collaborator

Thanks for the reminder. We have read these insightful papers. We will compare with them in the paper.

@ashwin296
Copy link
Author

And PG-TD, where there are value estimates.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants