diff --git a/template.html b/template.html index 2999500..cd32316 100644 --- a/template.html +++ b/template.html @@ -233,7 +233,8 @@
Every day, we run a set of tests to evaluate how GPT-4 Vision (GPT-4V) performs over time. These tests are designed to monitor core features of GPT-4V.
-Each test runs the same prompt and image through GPT-4V and compares the Result to a human-written Result. While making this website, we experimented with prompts and chose the prompt that gave the most accurate results.
+Each test runs the same prompt and image through GPT-4V and compares the result to a human-written result. While making this website, we experimented with prompts and chose the prompt that gave the most accurate results.
+There may be other prompts that can solve a given query. With that said, we cannot test every possible prompt. This site is designed to act as a reference; different prompts may achieve better or worse results.
Tests are run at 1am PT every day. This site is updated when all tests are complete.
If a line is red, it means the test failed that day; if a line is green, the test passed.