[RFC] Hard fork data migration #14288

psteckler · 2023-10-06T19:57:18Z

RFC on getting data from the mainnet archive db to a db using the berkeley schema.

The branch feature/berkeley-db-migrator is used in #12906.

The branch feature/add-berkeley-account-tables is used in #14339.

Part of #12676.

psteckler · 2023-10-06T19:57:31Z

!ci-build-me

ghost-not-in-the-shell · 2023-10-06T20:00:23Z

rfcs/0052-hard-fork-data-migration.md

@@ -0,0 +1,141 @@
+p## Summary


You may want to delete the p here

psteckler · 2023-10-06T20:01:47Z

!ci-build-me

ghost-not-in-the-shell · 2023-10-06T20:04:20Z

rfcs/0052-hard-fork-data-migration.md

+These applications can be run in sequence to get a fully-migrated
+database. They should be able to work incrementally, so that part of
+the mainnet database can be migrated and, as new blocks are added on
+mainnet, the new data in the databannnnnse can be migrated.


Whoops, again.

kaozenn · 2023-10-10T05:07:59Z

After the HF (Hard Fork), will the "bucket for storing migrated database dumps" become the new reference bucket for retrieving Archive DB Dumps?
Post HF, could we designate an Archive Dump as a reference for data retrieval? Applications such as Rosetta would not only need to be refactored to read from the new schema but also to pull data from the new S3 Bucket.

psteckler · 2023-10-10T05:25:59Z

After the HF (Hard Fork), will the "bucket for storing migrated database dumps" become the new reference bucket for retrieving Archive DB Dumps?

I would expect there will be a archive dump cron job for the new mainnet, which will write to the existing mina-archive-dumps bucket (or maybe some new one). It won't be the bucket described here.

Post HF, could we designate an Archive Dump as a reference for data retrieval? Applications such as Rosetta would not only need to be refactored to read from the new schema but also to pull data from the new S3 Bucket.

The current mainnet archive dumps are named mainnet-archive-dump-<DATE>_nnnn.sql. That naming convention could continue, or a new name could be chosen. As mentioned, the same bucket could continue to be used, or a new one created, if desired.

The current bucket is in Google Cloud Storage, not S3, which is an Amazon product.

kantp

Looks good to me.

kantp · 2023-10-20T11:28:27Z

rfcs/0052-hard-fork-data-migration.md

+
+How do we limit the migration to the final block of mainnet? There could be
+flags to the migration apps to stop at a given state hash or height.
+


I would suggest ending migration at slot_tx_end in RFC 51, #14138. We'll only have empty blocks for fork resolution after that anyway.

OK, in the first-phase migration app, in #12906, I've add a --end-global-slot command-line arg, and tested that. The second-phase app doesn't need to worry about the end slot, because it can only process what the first-phase app has produced.

If you omit that arg, the app will migrate only canonical blocks. My understanding is that there may be some pending blocks to be migration, so if you do provide that arg, the app will migrate both pending and canonical (but not orphaned) blocks.

I removed the unresolved question, because I think this command-line arg solves the problem.

deepthiskumar · 2023-11-02T01:51:18Z

!ci-build-me

deepthiskumar · 2023-11-02T01:51:35Z

!approved-for-mainnet

It's me, CI and others added 2 commits October 6, 2023 12:56

[RFC] Hard fork data migration

4a1db2b

Merge branch 'berkeley' into rfc/hard-fork-data-migration

eb5cb53

ghost-not-in-the-shell reviewed Oct 6, 2023

View reviewed changes

remove stray character

4edc6c4

ghost-not-in-the-shell reviewed Oct 6, 2023

View reviewed changes

It's me, CI and others added 4 commits October 6, 2023 13:07

remove more stray characters

89f88e9

Merge branch 'berkeley' into rfc/hard-fork-data-migration

642ca05

Merge branch 'berkeley' into rfc/hard-fork-data-migration

d2f9ae7

mention existing cron jobs

df636c5

deepthiskumar mentioned this pull request Oct 10, 2023

Implement archive data migration tooling #14313

Closed

Merge branch 'berkeley' into rfc/hard-fork-data-migration

cc23c2d

psteckler mentioned this pull request Oct 12, 2023

App to add berkeley account tables #14339

Merged

Merge branch 'berkeley' into rfc/hard-fork-data-migration

23896b6

kantp approved these changes Oct 20, 2023

View reviewed changes

psteckler and others added 2 commits October 20, 2023 14:39

Merge branch 'berkeley' into rfc/hard-fork-data-migration

da287f8

Remove unresolved question about ending migration

e9b717d

dkijania mentioned this pull request Oct 25, 2023

Test/deploy Archive node migration tool #14338

Closed

psteckler and others added 3 commits October 31, 2023 15:37

Merge branch 'berkeley' into rfc/hard-fork-data-migration

36a408a

Merge branch 'berkeley' into rfc/hard-fork-data-migration

b5f432a

Merge branch 'berkeley' into rfc/hard-fork-data-migration

82207d7

psteckler merged commit 2d815c9 into berkeley Nov 2, 2023
1 check passed

psteckler deleted the rfc/hard-fork-data-migration branch November 2, 2023 01:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RFC] Hard fork data migration #14288

[RFC] Hard fork data migration #14288

psteckler commented Oct 6, 2023 •

edited

Loading

psteckler commented Oct 6, 2023

ghost-not-in-the-shell Oct 6, 2023

psteckler Oct 6, 2023

psteckler commented Oct 6, 2023

ghost-not-in-the-shell Oct 6, 2023

psteckler Oct 6, 2023

kaozenn commented Oct 10, 2023

psteckler commented Oct 10, 2023

kantp left a comment

kantp Oct 20, 2023

psteckler Oct 20, 2023

psteckler Oct 20, 2023

deepthiskumar commented Nov 2, 2023

deepthiskumar commented Nov 2, 2023


		How do we limit the migration to the final block of mainnet? There could be
		flags to the migration apps to stop at a given state hash or height.

[RFC] Hard fork data migration #14288

[RFC] Hard fork data migration #14288

Conversation

psteckler commented Oct 6, 2023 • edited Loading

psteckler commented Oct 6, 2023

ghost-not-in-the-shell Oct 6, 2023

Choose a reason for hiding this comment

psteckler Oct 6, 2023

Choose a reason for hiding this comment

psteckler commented Oct 6, 2023

ghost-not-in-the-shell Oct 6, 2023

Choose a reason for hiding this comment

psteckler Oct 6, 2023

Choose a reason for hiding this comment

kaozenn commented Oct 10, 2023

psteckler commented Oct 10, 2023

kantp left a comment

Choose a reason for hiding this comment

kantp Oct 20, 2023

Choose a reason for hiding this comment

psteckler Oct 20, 2023

Choose a reason for hiding this comment

psteckler Oct 20, 2023

Choose a reason for hiding this comment

deepthiskumar commented Nov 2, 2023

deepthiskumar commented Nov 2, 2023

psteckler commented Oct 6, 2023 •

edited

Loading