The EAD AMI Ingest Plugin #195
Labels
AMI Import Plugins
Gets data from somewhere, puts data into a strawberry basked
CSV Processing
Things we do here and there to keep the tabulated goddesses happy
enhancement
New feature or request
File processing
Everything is a file, even me.
Ingest Setup
Knobs and Levers you move while thinking about feelings and metadata and CSV files
queue workers
Ones taking the FI and doing the FO
queue
FIFO
Milestone
What?
EADs are complex/fixed realities (XML) with sometimes thousands of containers. So far (@alliomeria is this true?) or approach of making each Container an ADO connected to its Top One + a connection to its parent one, has show potential and good concern separation. But that also implies, a batch set might imply 100s of thousands of rows. So ... To avoid User errors and since the data modeling opportunities here could overwhelm our community (and you all already have a generic CSV importer) the task at hand here is
Create an opinionated (like me) but kind (like you) Plugin that takes
A)
a CSV generated by another well formed script (you run this on your local) or (pass 2 of this issue)B)
a ZIP file with XML files. (yes a ZIP, not remote, not anything).For the former, You can have nested CSVs. The main CSV can contain in each ROW a column referencing another CSV (which needs to be provided as a file either in S3:// or inside a ZIP:// our an URL:// (same as you would with e.g an Image attached to an ADO).
A mapping to select which template to run for Rows, which template to run for the per-row CSV rows. (so max 2 templates)
A mapping to select which bundle to run for the main + for the rows. By default, if our custom Drupal Content type was loaded/created/present (will ship with a next release) no selection will be available (same bundle)
Types are fixed (so much control!)
A new Queue worker which has as only purpose to read a CSV and push future items into the queue. This new Queue worker can/will be also available for the other plugins, allowing (new feature) to avoid waiting while the "form" processes each CSV row, basically delegating that very action to a queue item. For the EAD itself, this will basically push into the queue after every EAD ADO queue item, this one for the Child CSV generating this pseudo structure
The text was updated successfully, but these errors were encountered: