feat(debug): Log a privacy preserving hash of IP and UserAgent to assist in rate limiting debugging #148

ryanschneider · 2024-07-10T22:12:55Z

📝 Summary

We are seeing abuse of our endpoint that is bypassing our firewall rules. To help determine how they are getting past the firewall we need to log the "source" of the traffic in a privacy-preserving way.

~~What I came up with is the xxhash( x-forwarded-for + user-agent ) or more specifically the left-most non-private IP in the X-Forwarded-For and the User-Agent headers.~~

However, after discussing w/ @0x416e746f6e he made the good point that User-Agent can be gamed by randomly generating a new UA w/ each request. Furthermore, the fingerprint should not be long-term stable, as that would allow a malicious actor with access to Flashbot logs the ability to track user behavior long-term. So now the hash is salted w/ the current timestamp truncated to the hour, and the UA is removed. And finally, the truncated timestamp is XOR'ed w/ a random uint64 at startup to prevent "rainbow table" attacks where a malicious RPC operator exhaustively hashes all IP/timestamp combinations to determine the source IP of a fingerprint.

In addition, we currently use proxyd as our upstream which by default uses the X-Forwarded-For header for rate limiting. Since xxhash is a uint64 it actually fits perfectly in an IPv6 address, so we convert the fingerprint to a fake IPv6 address in the reserved "example address" documentation prefix from https://datatracker.ietf.org/doc/html/rfc3849 and insert this "IP" as the X-Forwarded-For field.

⛱ Motivation and Context

Give us to means to perform log aggregation to see if the offending traffic is coming from a single or multiple sources.

📚 References

✅ I have run these commands

make lint
make test
go mod tidy

server/fingerprint.go

metachris

code looks great. left a few minor comment but nothing important

ryanschneider added 6 commits July 9, 2024 08:12

chore(TEST): Log X-Forwarded-For (will deploy to staging for testing)

77b26bf

fingerprint hash of XFF and UA

0d7748c

Proper fingerprint implementation

a5cdd7e

Remove logging of PII (IP and UA)

fb14167

fixup: fix lint

5fcd5d6

Move Fingerprint to its own file, and basic unit test.

426ad10

ryanschneider marked this pull request as ready for review July 11, 2024 22:39

ryanschneider requested review from dvush, TymKh and metachris as code owners July 11, 2024 22:39

metachris reviewed Jul 23, 2024

View reviewed changes

server/fingerprint.go Outdated Show resolved Hide resolved

metachris reviewed Jul 23, 2024

View reviewed changes

server/fingerprint.go Show resolved Hide resolved

metachris reviewed Jul 23, 2024

View reviewed changes

server/fingerprint.go Outdated Show resolved Hide resolved

metachris approved these changes Jul 23, 2024

View reviewed changes

ryanschneider added 2 commits July 23, 2024 15:00

Address PR feedback.

eb3f214

Add seed to prevent exhaustive IP lookup

855a64f

metachris approved these changes Jul 24, 2024

View reviewed changes

ryanschneider merged commit ebf1086 into main Jul 24, 2024
2 checks passed

ryanschneider deleted the log-x-forwarded branch July 24, 2024 15:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(debug): Log a privacy preserving hash of IP and UserAgent to assist in rate limiting debugging #148

feat(debug): Log a privacy preserving hash of IP and UserAgent to assist in rate limiting debugging #148

ryanschneider commented Jul 10, 2024 •

edited

Loading

metachris left a comment

feat(debug): Log a privacy preserving hash of IP and UserAgent to assist in rate limiting debugging #148

feat(debug): Log a privacy preserving hash of IP and UserAgent to assist in rate limiting debugging #148

Conversation

ryanschneider commented Jul 10, 2024 • edited Loading

📝 Summary

⛱ Motivation and Context

📚 References

✅ I have run these commands

metachris left a comment

Choose a reason for hiding this comment

ryanschneider commented Jul 10, 2024 •

edited

Loading