You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This is a really nice work that contributes to new reward models in the coding domain.
However, many relevant prior works are not cited. The claim that "very few models have explored the potential of reinforcement learning" in the introduction is misleading (currently, only CodeRL is cited in Section 5.4).
It would be great to clarify the contributions to reward model training (which are quite valuable) and adjust the phrasing to better reflect prior works.
Thanks again for open-sourcing!
-- a Reinforcement Learner : )
The text was updated successfully, but these errors were encountered:
This is a really nice work that contributes to new reward models in the coding domain.
However, many relevant prior works are not cited. The claim that "very few models have explored the potential of reinforcement learning" in the introduction is misleading (currently, only CodeRL is cited in Section 5.4).
To name a few:
It would be great to clarify the contributions to reward model training (which are quite valuable) and adjust the phrasing to better reflect prior works.
Thanks again for open-sourcing!
-- a Reinforcement Learner : )
The text was updated successfully, but these errors were encountered: