Finish skeleton common-accessioning robot and workflow def for `speechToTextWF` #1341

jmartin-sul · 2024-09-12T19:35:30Z

The start of the code for this is already there, and assumes the name will be captionWF: https://github.com/sul-dlss/common-accessioning/tree/main/lib/robots/dor_repo/caption

But part of this work will be renaming in light of the decision to go with speechToTextWF. See terminology note on sul-dlss/speech-to-text#1

The workflow XML is not yet in place, but the skeleton code is already present.

The workflow definition would be a new XML file here: https://github.com/sul-dlss/workflow-server-rails/tree/main/config/workflows

The skeleton workflow with placeholders for steps we expect, with implementations filled in as supporting services are developed, will be similar to what we did for ocrWF.

Steps we're likely to have, in order:

writing media files to S3
(maybe) signaling that those files need to be transcribed (if simply writing them to the bucket isn't enough)
picking up a workflow step signaling that the transcription output has been placed in an S3 bucket
updating the cocina appropriately and accessioning the output files

Currently, there is placeholder workflow step code for starting the workflow, ending it, and generating speech-to-text output (this last part will very likely be broken into the multiple steps describe above).

The text was updated successfully, but these errors were encountered:

peetucket · 2024-09-18T20:13:51Z

Closed by #1342

This was referenced Sep 12, 2024

Finish skeleton common-accessioning robot and workflow def for... captionWF? speechToTextWF? [final name TBD] sul-dlss/speech-to-text#8

Closed

[EPIC] Prototype workflow for generating and accessioning speech-to-text extraction sul-dlss/speech-to-text#1

Open

jmartin-sul added the blocked prereqs for this ticket aren't done yet label Sep 12, 2024

peetucket removed the blocked prereqs for this ticket aren't done yet label Sep 13, 2024

peetucket changed the title ~~Finish skeleton common-accessioning robot and workflow def for... captionWF? speechToTextWF? [final name TBD]~~ Finish skeleton common-accessioning robot and workflow def for... speechToTextWF Sep 13, 2024

peetucket changed the title ~~Finish skeleton common-accessioning robot and workflow def for... speechToTextWF~~ Finish skeleton common-accessioning robot and workflow def for speechToTextWF Sep 13, 2024

peetucket self-assigned this Sep 13, 2024

This was referenced Sep 13, 2024

Starting version of speech to text robots #1342

Merged

create initial version of speechToText workflow sul-dlss/workflow-server-rails#804

Merged

peetucket closed this as completed Sep 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Finish skeleton common-accessioning robot and workflow def for `speechToTextWF` #1341

Finish skeleton common-accessioning robot and workflow def for `speechToTextWF` #1341

jmartin-sul commented Sep 12, 2024 •

edited

Loading

peetucket commented Sep 18, 2024

Finish skeleton common-accessioning robot and workflow def for speechToTextWF #1341

Finish skeleton common-accessioning robot and workflow def for speechToTextWF #1341

Comments

jmartin-sul commented Sep 12, 2024 • edited Loading

peetucket commented Sep 18, 2024

Finish skeleton common-accessioning robot and workflow def for `speechToTextWF` #1341

Finish skeleton common-accessioning robot and workflow def for `speechToTextWF` #1341

jmartin-sul commented Sep 12, 2024 •

edited

Loading