-
Notifications
You must be signed in to change notification settings - Fork 33
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
some questions #83
Comments
Yes, unless you just want to check the ground-truth pass rate. To evaluate any models, you need to do the generation first. |
I already have results generated by other models, and now I want to score them. Can I only use the https://bigcode-bigcodebench-evaluator.hf.space/ you provided to get the scores? |
Regarding the gradio (HF space) endpoint, please refer to the following note:
For any other execution methods, please refer to ADVANCED_USAGE., where you can choose the execution from |
OK, thanks |
I want to ask if only https://bigcode-bigcodebench-evaluator.hf.space/ can be used to generate scores after the results are generated.
The text was updated successfully, but these errors were encountered: