Skip to content

Latest commit

 

History

History
134 lines (109 loc) · 8.32 KB

README.md

File metadata and controls

134 lines (109 loc) · 8.32 KB

aoc-fastest

Check below for resources and optimization guides!

Quick Links to Solutions

Results

here's the total of the fastest times for each day so far:

day | part |    time | user            | source available
--- | ---- | ------- | --------------- | ----------------
  1 |    1 |    9150 | doge            | yes
  1 |    2 |    4945 | doge            | yes
  2 |    1 |    3274 | giooschi        | yes
  2 |    2 |    3749 | giooschi        | yes
  3 |    1 |    2138 | alion02         | yes
  3 |    2 |    2391 | ameo            | yes
  4 |    1 |    3636 | giooschi        | yes
  4 |    2 |     691 | bendn           | yes
  5 |    1 |    5467 | giooschi        | yes
  5 |    2 |    9440 | giooschi        | yes
  6 |    1 |    5527 | doge            | yes
  6 |    2 |   66803 | giooschi        | yes
  7 |    1 |    5413 | giooschi        | yes
  7 |    2 |    7516 | giooschi        | yes
  8 |    1 |     725 | alion02         | yes
  8 |    2 |    2146 | bendn           | yes
  9 |    1 |   15850 | alion02         | yes
  9 |    2 |   49969 | ameo            | yes
 10 |    1 |    3013 | giooschi        | yes
 10 |    2 |    4488 | _mwlsk          |  no*
 11 |    1 |      22 | giooschi        | yes
 11 |    2 |      19 | giooschi        | yes
 12 |    1 |   24238 | giooschi        | yes
 12 |    2 |   25721 | giooschi        | yes
 13 |    1 |    1902 | alion02         | yes
 13 |    2 |    2128 | goldsteinq      |  no*
 14 |    1 |    3540 | giooschi        | yes
 14 |    2 |    2072 | giooschi        | yes
 15 |    1 |   24386 | alion02         | yes
 15 |    2 |   34862 | alion02         | yes
 16 |    1 |   43778 | alion02         | yes
 16 |    2 |   56360 | giooschi        | yes
 17 |    1 |      12 | alion02         | yes
 17 |    2 |       1 | alion02         | yes
 18 |    1 |    2865 | alion02         | yes
 18 |    2 |   12838 | caavik          | yes
 19 |    1 |   12362 | giooschi        | yes
 19 |    2 |   18610 | giooschi        | yes
 20 |    1 |   16407 | giooschi        | yes
 20 |    2 |   47626 | giooschi        | yes
 21 |    1 |       3 | bendn/giooschi  | yes
 21 |    2 |       3 | bendn/giooschi  | yes
 22 |    1 |    6703 | giooschi        | yes
 22 |    2 |  423158 | caavik+giooschi | yes
 23 |    1 |   10031 | giooschi        | yes
 23 |    2 |    7357 | giooschi        | yes
 24 |    1 |    1830 | giooschi        | yes
 24 |    2 |    1436 | giooschi        | yes
 25 |    1 |    2335 | giooschi        | yes
---------------------------------------------------------
              988936ns

for a total of 989us!

For any entry where source available is no*, the next fastest solution is shown instead because the author of the fastest solution has not yet agreed to have their code displayed here.

Further Reading

IMO the best way to learn is to participate, which is why I highly encourage people to try to optimize AoC solutions themselves. It's a fantastic way to learn SIMD. If you decide to do so, absolutely join the Rust Programming Language Community discord server! It's a wonderful community with incredibly talented and knowledgeable folks who are happy to help you optimize. I've learned about topics like instruction pipelines, cache misses, and SIMD just by following the discussions there!

In-depth explanations of these topics would be super helpful. I hope to some day write those explanations myself.

In the meantime, if you would like a more in-depth explanation of some of the optimization techniques used, I highly recommend you check out this article by ameo (one of our participants). It covers the process they used to optimize their solution for Day 9 Part 2, and how they got it to the top of our leaderboard. The article provides incredible information on the process of both high-level and micro optimization:

Also check out the following:

Credits

  • Thank you to the members of the Rust Programming Language Community and Serenity-rs Discord servers and everyone else who participated in the challenge!
  • Thank you to Eric Wastl for hosting AoC every year!
  • Thank you to Noxim for writing the original version of our benchmark bot.
  • Extra special thank you to yuyuko, bend-n, and giooschi for their help in maintaining and improving our benchmark bot.

This repo contains code/optimizations from the following authors:

Thank you so much to these talented individuals for participating in AoC with us! I highly encourage you to checkout their repos for more details and examples. These people are incredibly talented at what they do.

This repo contains code submitted to the https://github.com/indiv0/ferris-elf bot, by multiple users. Code is only included in this repo if the authors have explicitly provided permission to post their code here, so some solutions may not be present.

Background

This year, some members of the Rust Programming Language Community Server on Discord set out to solve AoC in under 1ms. I'm pleased to announce that through the use of LUTs, SIMD, more-than-questionable unsafe, assertions, LLVM intrinsics, and even some inline ASM that goal has been reached!

If you are interested, join us in #advent-of-code-2024 on the Discord server for further discussion :)

Context/Caveats

  • All submissions were run on the same hardware (Ryzen 5950X) to ensure consistency, with the same compiler flags and features available. This was on rustc nightly (updated throughout the course of the contest), and with CPU speed capped at 3400 MHz with boost clock disabled.
  • AVX-512 was not available on the machine so none (?) of the solutions utilize that particular set of accelerated instructions, but there is plenty of other SIMD in use.
  • All submissions were run against the same inputs to ensure consistency.
  • Caching anything that has been fed with input was not allowed to prevent cheating and/or trivial solutions like Map<Input, Output>.
  • For the same reason, inputs were not directly available to the participants, and were not provided at compile-time.
  • Participants were allowed to use compile-time tricks in their answers. Due to limitations in the benchmark bot, the runtime of these optimizations could not be measured. This was considered acceptable as the compiled binaries were expected to otherwise work correctly for arbitrary inputs. This means that participants are allowed to use look-up tables (LUTs) in their answers, but those LUTs are expected to work for arbitrary inputs, not just specific ones.
  • I/O is trivial, and was thus not measured as part of the benchmark. That is, participants were provided with an &str or &[u8] input (their choice) and expected to provide an impl Display as part of their result. Therefore, input parsing was measured.