lib: actually hash all 16 bytes of IPv6 addresses, not just 4 #17901

eqvinox · 2025-01-22T11:18:19Z

Core change: We were hashing 4 bytes of the address. Even for IPv6 addresses. Oops.

Related changes:

went through all initializations and assignments of gate src and rmap_src to check that we're not feeding garbage into the 12 unused bytes and hash that
- it should be noted that this was already the behavior before 2019 - all 16 bytes were unconditionally fed into the hash. This PR goes back to that behavior, but it is possible we might have accumulated places where we get junk data into the 12 padding bytes. Hence the cleanup patches in this PR.
- rmap_src use is just… wrong… this does a best-effort fixup but it's only a bandaid.
since the original change with the 4 bytes was done to "speed up things" I've gone and refactored nexthop_hash as a whole. It just feeds a contiguous block of struct nexthop into jhash now, this should perform better than the jhash_word shenanigans we've gone to before.
topotest contributed by Donald, Thanks! 😃

this PR is incomplete, nhg_compare_nexthops needs to descend into ->resolved (cf. #16970, but that PR is also incomplete)

Unfortunately the test should only go red if both the compare and the hash are broken at the same time, so I'd prefer to also fix NHG comparison to see if fixing that (and having the hash broken) makes the test go green to validate that. But fixing the compare needs a bit more work (I'll keep updating this PR)

While the loop is currently exited in all cases after using nexthop, it is a footgun to have "nh" around to be reused in another iteration of the loop. This would leave nexthop with partial data from the previous use. Make it local where needed instead. Signed-off-by: David Lamparter <[email protected]>

Zero out the 12 unused bytes (for the IPv6 address) when reading in an IPv4 address. Signed-off-by: David Lamparter <[email protected]>

Doesn't seem to break anything but really poor style to pass potentially uninitialized data to hash_lookup. Signed-off-by: David Lamparter <[email protected]>

mjstapp · 2025-01-22T12:41:04Z

bgpd/bgp_pbr.c

@@ -2624,7 +2624,6 @@ static void bgp_pbr_policyroute_add_to_zebra(struct bgp *bgp,
 static void bgp_pbr_handle_entry(struct bgp *bgp, struct bgp_path_info *path,
 				 struct bgp_pbr_entry_main *api, bool add)
 {
-	struct nexthop nh;


hmm - could we just initialize this with = {}; instead of adding multiple versions in inner scopes?

That doesn't fix the footgun mentioned in the commit description: this is used in a loop below, and while currently all paths exit the loop after using the nh variable if that doesn't remain the case we can end up with leftover garbage from a previous iteration of the loop.

It not being initialized isn't the thing being addressed here, in fact it's being cleared with memset twice in the current code (for good measure, I guess?)

mjstapp · 2025-01-22T13:05:22Z

lib/nexthop.h

 	 */
+	char _hash_begin[0];


hmm - I get that there's an assert, but ... could we just fix the core problem, that we're not hashing enough v6 octets? if someone could measure a meaningful improvement to nexthop hashing - at the level of performance that we offer - then there could be a separate improvement PR that reorganized the code struct?

My perspective is that the bug we're dealing with is best described as "we weren't hashing the things we thought we were hashing". While it's also intended to perform well by doing it in one block — the doing it in one block in itself also makes it clearer what exactly is being hashed?

But I'm not terribly set on this

rmap_src wasn't initialized, so for IPv4 the unused 12 bytes would contain whatever junk is on the stack on function entry. Also move the IPv4 parse before the IPv6 parse so if it's successful we can be sure the other bytes haven't been touched. Signed-off-by: David Lamparter <[email protected]>

When reading in a nexthop from ZAPI, only set the fields that actually have meaning. While it shouldn't happen to begin with, we can otherwise carry padding garbage into the unused leftover union bytes. Signed-off-by: David Lamparter <[email protected]>

We were hashing 4 bytes of the address. Even for IPv6 addresses. Oops. The reason this was done was to try to make it faster, but made a complex maze out of everything. Time for a refactor. Signed-off-by: David Lamparter <[email protected]>

We've accumulated a pair of rather fundamental mixups regarding nexthop hashing and recursive nexthops. Add kind of a "canary" test to make sure the basics are in place -- if this breaks, something has seriously gone wrong. Signed-off-by: Donald Sharp <[email protected]>

eqvinox added 3 commits January 22, 2025 11:13

fpm: guard against garbage in unused address bytes

7b0b7ee

Zero out the 12 unused bytes (for the IPv6 address) when reading in an IPv4 address. Signed-off-by: David Lamparter <[email protected]>

pbrd: initialize structs used in hash_lookup

524d0d3

Doesn't seem to break anything but really poor style to pass potentially uninitialized data to hash_lookup. Signed-off-by: David Lamparter <[email protected]>

eqvinox requested review from donaldsharp and mjstapp January 22, 2025 11:18

frrbot bot added bgp fpm libfrr pbr zebra labels Jan 22, 2025

github-actions bot added master size/L labels Jan 22, 2025

piotrsuchy mentioned this pull request Jan 22, 2025

lib: fix for incorrect nexthop lookup for recursive routes #16970

Open

mjstapp reviewed Jan 22, 2025

View reviewed changes

eqvinox and others added 4 commits January 23, 2025 15:27

lib: clean up nexthop hashing mess

97f3bfe

We were hashing 4 bytes of the address. Even for IPv6 addresses. Oops. The reason this was done was to try to make it faster, but made a complex maze out of everything. Time for a refactor. Signed-off-by: David Lamparter <[email protected]>

eqvinox force-pushed the nexthop_hashing branch from fb87a20 to 05b7942 Compare January 23, 2025 14:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lib: actually hash all 16 bytes of IPv6 addresses, not just 4 #17901

lib: actually hash all 16 bytes of IPv6 addresses, not just 4 #17901

eqvinox commented Jan 22, 2025

mjstapp Jan 22, 2025

eqvinox Jan 22, 2025

mjstapp Jan 22, 2025

eqvinox Jan 22, 2025

lib: actually hash all 16 bytes of IPv6 addresses, not just 4 #17901

Are you sure you want to change the base?

lib: actually hash all 16 bytes of IPv6 addresses, not just 4 #17901

Conversation

eqvinox commented Jan 22, 2025

mjstapp Jan 22, 2025

Choose a reason for hiding this comment

eqvinox Jan 22, 2025

Choose a reason for hiding this comment

mjstapp Jan 22, 2025

Choose a reason for hiding this comment

eqvinox Jan 22, 2025

Choose a reason for hiding this comment