feat(identify): implement signedPeerRecord #5785

drHuangMHT · 2024-12-31T10:13:08Z

Description

May close #4017.

Notes & open questions

pb-rs now uses Cow<'_,T> for the compiled Rust structs. But with borrowed type in the struct, FramedRead can no longer process frames correctly(trait bound not statisfied).

Change checklist

I have performed a self-review of my own code
I have made corresponding changes to the documentation
I have added tests that prove my fix is effective or that my feature works
A changelog entry has been made in the appropriate crates

drHuangMHT · 2024-12-31T10:15:45Z

Oh CI will compile the protobuf code, didn't see that.
EDIT: Wait how does this work?

dariusc93

Left some comments :)

protocols/identify/src/behaviour.rs

getong · 2025-01-03T01:55:04Z

you might rebase to main branch, as zlib license has added to ci， #5769

getong · 2025-01-03T03:01:21Z

Files in /home/runner/work/rust-libp2p/rust-libp2p/protocols/identify have changed, please write a changelog entry in /home/runner/work/rust-libp2p/rust-libp2p/protocols/identify/CHANGELOG.md

I think you might write some changelog text into this file, just like,

implement signedPeerRecord

the pr might be okay now.

drHuangMHT · 2025-01-03T03:05:49Z

the pr might be okay now.

Thanks for the review! Working on tests right now.

drHuangMHT · 2025-01-03T04:34:31Z

version 0.45.1 of libp2p-identify hasn't been released on crates.io yet, I'm not sure whether I should add a changelog entry right now.

elenaf9 · 2025-01-09T09:51:44Z

version 0.45.1 of libp2p-identify hasn't been released on crates.io yet, I'm not sure whether I should add a changelog entry right now.

CHANGELOG will be fixed with #5803, you can just add the new entry to 0.46.0.

elenaf9

Thank you @drHuangMHT! Looks like a great start.

I found a pending libp2p spec for signed peer records: libp2p/specs#630. Most of it matches this PR, just one thing is missing:

If the signedPeerRecord is present the implementation MUST use the data contained within it and ignore duplicated fields present in the main identify message

IMO that makes sense and should be added to this PR. So instead of copying the received record as it is in handle_incoming_info, I think we should instead (when an signed envelope is present):

Verify that the signature matches the remote's public key
Deserialize the record, store the resulting addresses as listen_addrs, and ignore the original listen_addrs.

Wdyt?

protocols/identify/src/behaviour.rs

elenaf9 · 2025-01-11T12:55:46Z

protocols/identify/src/protocol.rs

+        // When signedPeerRecord contains valid addresses, ignore addresses in listenAddrs.  
+        // When signedPeerRecord is invalid or signed by others, ignore the signedPeerRecord(set to `None`).  
+        let (signed_peer_record, listen_addrs) = signed_peer_record
+            .as_ref()
+            .and_then(|envelope| PeerRecord::try_deserialize_signed_envelope(&envelope).ok())
+            .and_then(|(envelope_public_key, _, _, addresses)| {
+                (*envelope_public_key == public_key).then_some(addresses)
+            })
+            .map(|addrs| (signed_peer_record, addrs))
+            .unwrap_or_else(|| (None, parse_listen_addrs(msg.listenAddrs)));


Do we really need to deserialize the record again? Can't we just read out PeerRecord::adddresses:

let listen_addrs = signed_peer_record.map_or_else( || parse_listen_addrs(msg.listenAddrs), |record| record.addresses.to_vec() );

Ahh we also need to check that the envelope key matches, but that can be done as well by reading the PeerRecord::peer_id and comparing it with the remote's id, right?

Do we really need to deserialize the record again? Can't we just read out PeerRecord::adddresses:

let listen_addrs = signed_peer_record.map_or_else( || parse_listen_addrs(msg.listenAddrs), |record| record.addresses.to_vec() );

record has type SignedEnvelope, whose payload is in the form of Vec<u8>(bytes) that doesn't have addresses field unless we deserialize the payload into PeerRecord. In order to own the addresses the only way is to call record.addresses().to_vec() which in this case allocates twice, which is unnecessary. So I extracted PeerRecord::try_deserialize_signed_envelope to cut down allocation.

Ahh we also need to check that the envelope key matches, but that can be done as well by reading the PeerRecord::peer_id and comparing it with the remote's id, right?

PeerId is derived from a public key, which indirectly proves its identity while the key itself does so directly. ID can collide, but the key is less likely, considering we support multiple key types. I believe it is safer to compare the keys directly when they are already present.

Sorry for the late reply!

record has type SignedEnvelope, whose payload is in the form of Vec<u8>(bytes) that doesn't have addresses field unless we deserialize the payload into PeerRecord.

Ah I see. The name signed_peer_record had me confused. How about calling it signed_envelope instead, so it matches the type (also in Info)?

In order to own the addresses the only way is to call record.addresses().to_vec() which in this case allocates twice, which is unnecessary. So I extracted PeerRecord::try_deserialize_signed_envelope to cut down allocation.

Good point. Still, I would expect that the compiler optimizes away the unnecessary allocation. IMO the current code is a bit difficult to understand, and for the sake of simplicity it would be worth it to do the conversion to PeerRecord so that we can then simply clone PeerRecord::addresses.
However, if you have a strong preference for the current implementation I am okay with it. But then I think PeerRecord::try_deserialize_signed_envelope logic better fits as SignedEnvelope::try_deserialize(&self) -> ... (same logic, just move to SignedEnvelope).

PeerId is derived from a public key, which indirectly proves its identity while the key itself does so directly. ID can collide, but the key is less likely, considering we support multiple key types. I believe it is safer to compare the keys directly when they are already present.

We rely on the collision resistance of PeerId throughout all of rust-libp2p, and go-libp2p also only compares IDs here. I'm not against comparing public keys, but given that PeerRecord::try_deserialize_signed_envelope already compares the PeerIds I don't think we need the double check here.

However, if you have a strong preference for the current implementation I am okay with it. But then I think PeerRecord::try_deserialize_signed_envelope logic better fits as SignedEnvelope::try_deserialize(&self) -> ... (same logic, just move to SignedEnvelope).

I don't think it is a good idea to implement deserialization on SignedEnvelope because it can contain arbitrary payload, including but not limited to PeerRecord. We need specific "headers" to properly deserialize the envelope, according to the spec.

We rely on the collision resistance of PeerId throughout all of rust-libp2p, and go-libp2p also only compares IDs here.

That's fair. Good to know.

I'm not against comparing public keys, but given that PeerRecord::try_deserialize_signed_envelope already compares the PeerIds I don't think we need the double check here.

The two checks are different. In try_deserialize_signed_envelope it checks whether the signer of the record matches the signer of the envelope. While in Info::try_from we essentially check the signer of the envelope(therefore signer of the address record) against the sender of the identify message. If we skip the check in try_from, the address can come from peers other than the message sender.

implement signedPeerRecord

2a09cdb

dariusc93 reviewed Dec 31, 2024

View reviewed changes

protocols/identify/src/behaviour.rs Outdated Show resolved Hide resolved

protocols/identify/src/behaviour.rs Outdated Show resolved Hide resolved

protocols/identify/src/behaviour.rs Outdated Show resolved Hide resolved

apply suggestions

b5d7aec

Merge branch 'master' into identify-peer-record

85d7496

add test

4d3d692

drHuangMHT added 2 commits January 4, 2025 13:23

reduce diff

59f80f2

Merge branch 'master' into identify-peer-record

ef89b5b

drHuangMHT marked this pull request as ready for review January 4, 2025 05:27

elenaf9 requested changes Jan 9, 2025

View reviewed changes

protocols/identify/src/behaviour.rs Outdated Show resolved Hide resolved

protocols/identify/src/behaviour.rs Outdated Show resolved Hide resolved

drHuangMHT added 4 commits January 10, 2025 12:26

rename symbols

12b9a29

prefer addresses in signedPeerRecord

d3d8ae6

rename symbols

59151b6

lint and fmt

9b6a139

elenaf9 reviewed Jan 11, 2025

View reviewed changes

Merge branch 'master' into identify-peer-record

cf30682

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(identify): implement signedPeerRecord #5785

feat(identify): implement signedPeerRecord #5785

drHuangMHT commented Dec 31, 2024 •

edited

Loading

drHuangMHT commented Dec 31, 2024 •

edited

Loading

dariusc93 left a comment

getong commented Jan 3, 2025

getong commented Jan 3, 2025

drHuangMHT commented Jan 3, 2025

drHuangMHT commented Jan 3, 2025

elenaf9 commented Jan 9, 2025

elenaf9 left a comment

elenaf9 Jan 11, 2025

elenaf9 Jan 11, 2025

drHuangMHT Jan 11, 2025

drHuangMHT Jan 12, 2025

elenaf9 Feb 8, 2025 •

edited

Loading

drHuangMHT Feb 8, 2025

feat(identify): implement signedPeerRecord #5785

Are you sure you want to change the base?

feat(identify): implement signedPeerRecord #5785

Conversation

drHuangMHT commented Dec 31, 2024 • edited Loading

Description

Notes & open questions

Change checklist

drHuangMHT commented Dec 31, 2024 • edited Loading

dariusc93 left a comment

Choose a reason for hiding this comment

getong commented Jan 3, 2025

getong commented Jan 3, 2025

drHuangMHT commented Jan 3, 2025

drHuangMHT commented Jan 3, 2025

elenaf9 commented Jan 9, 2025

elenaf9 left a comment

Choose a reason for hiding this comment

elenaf9 Jan 11, 2025

Choose a reason for hiding this comment

elenaf9 Jan 11, 2025

Choose a reason for hiding this comment

drHuangMHT Jan 11, 2025

Choose a reason for hiding this comment

drHuangMHT Jan 12, 2025

Choose a reason for hiding this comment

elenaf9 Feb 8, 2025 • edited Loading

Choose a reason for hiding this comment

drHuangMHT Feb 8, 2025

Choose a reason for hiding this comment

drHuangMHT commented Dec 31, 2024 •

edited

Loading

drHuangMHT commented Dec 31, 2024 •

edited

Loading

elenaf9 Feb 8, 2025 •

edited

Loading