Link to the main repository: https://github.com/GammaTauAI/opentau
Link to the dataset used for evaluation: https://huggingface.co/datasets/nuprl/ts-eval
Configuration | Percent Typecheck | Avg Heuristic | Avg type errors |
---|---|---|---|
Santacoder TS FT - Baseline - 2048 | 245/744 (32.9%) | 200.7 | 4.7 |
Santacoder FIT FT - Baseline - 2048 | 297/744 (39.9%) | 200.9 | 5.2 |
Santacoder FIT FT - Baseline - 1024 | 248/744 (33.3%) | 200.7 | 5.1 |
Santacoder FIT FT - Baseline - 512 | 178/744 (23.9%) | 201.2 | 6.3 |
Santacoder FIT FT - Tree, no usages - 2048 | 274/744 (36.8%) | 168.4 | 3.7 |
Santacoder FIT FT - Tree, usages - 2048 | 353/744 (47.4%) | 154.6 | 3.3 |
Configuration | File name in the repository |
---|---|
Santacoder TS FT - Baseline - 2048 | santacoder_baseline_finetuned_ts_ts-eval.json |
Santacoder FIT FT - Baseline - 2048 | santacoder_baseline_finetuned_fim_2048_ts-eval.json |
Santacoder FIT FT - Baseline - 1024 | santacoder_baseline_finetuned_fim_1024_ts-eval.json |
Santacoder FIT FT - Baseline - 512 | santacoder_baseline_finetuned_fim_512_ts-eval.json |
Santacoder FIT FT - Tree, no usages | santacoder_tree_nousages_ts-eval.json |
Santacoder FIT FT - Tree, usages | santacoder_tree_ts-eval.json |
- Metrics in bold are the best results for each column.
- All rows for the 744 runs had 0 average syntax errors, thus we did not include the column in the table.
- FIT FT means that the model was fine-tuned for Fill-In-the-Type, for more information about the technique, please refer to the repository: https://github.com/GammaTauAI/santacoder-finetuning
- TS FT means that the model was fine-tuned for TypeScript code generation, NOT for Fill-In-the-Type (FIT) or Fill-In-the-Middle (FIM).
- The numbers at the end (e.g. 2048, 1024, 512) are the maximum number of characters for the context window when running inference with the model.
Both rows used the Baseline and the maximum context window size was 2048 characters.
Configuration | Percent Typecheck | Avg Heuristic | Avg type errors | Avg syntax errors |
---|---|---|---|---|
Santacoder TS FT | 1/50 (2.0%) | 0 | 121.2 | 42.1 |
Santacoder FIT FT | 25/50 (50.0%) | 230.0 | 4.6 | 0.2 |
Configuration | File name in the repository |
---|---|
Santacoder TS FT | santacoder_baseline_noparser_normal_ts-eval-subset.json |
Santacoder FIT FT | santacoder_baseline_noparser_fim_finetuned_ts-eval-subset.json |