Write simple additional performance tests #105

spigo900 · 2024-12-04T15:12:43Z

Addresses part of #45.

(locally, anyway) This is counting the overall test time, so including time to load the model. If I find a way to load the model once for the entire benchmark then that will go away and I can scale up the test "size" again.

spigo900 · 2024-12-04T15:22:34Z

@benlebrun As it stands:

I run the tests using make benchmark to run them all, or pytest perf_tests/test_inference -k test_name_substring to run a single test.
The long sequences test should be short enough -- it ran in 1 minute 45 seconds locally, total, when I ran only that test.
The permissive grammar test takes ~5 minutes as it stands, so I would shorten the token limit from 100 tokens to 40 tokens I think and adjust from there.
I don't know where the "many particles" benchmark stands.
The benchmarks seem to take longer than the benchmark printout reports, probably because of time spent loading the model. It might be possible to fix these by putting model loading into a fixture, but I don't know if there is a nice way to do that because each test needs to use its own grammar in the inference setup.

So, the tests are in place but the parameters still need adjusting, and we might be able to increase the test parameters if we can factor out the model loading/inference setup somehow.

spigo900 added 4 commits December 2, 2024 14:01

Write simple additional performance tests

628b535

Run benchmarks on benchmark branch

555dee6

Specify token limits for inference

8c3a502

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Write simple additional performance tests #105

Write simple additional performance tests #105

spigo900 commented Dec 4, 2024

spigo900 commented Dec 4, 2024

Write simple additional performance tests #105

Are you sure you want to change the base?

Write simple additional performance tests #105

Conversation

spigo900 commented Dec 4, 2024

spigo900 commented Dec 4, 2024