Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Finish skeleton common-accessioning robot and workflow def for speechToTextWF #1341

Closed
Tracked by #1
jmartin-sul opened this issue Sep 12, 2024 · 1 comment
Closed
Tracked by #1
Assignees

Comments

@jmartin-sul
Copy link
Member

jmartin-sul commented Sep 12, 2024

The start of the code for this is already there, and assumes the name will be captionWF: https://github.com/sul-dlss/common-accessioning/tree/main/lib/robots/dor_repo/caption

But part of this work will be renaming in light of the decision to go with speechToTextWF. See terminology note on sul-dlss/speech-to-text#1

The workflow XML is not yet in place, but the skeleton code is already present.

The workflow definition would be a new XML file here: https://github.com/sul-dlss/workflow-server-rails/tree/main/config/workflows

The skeleton workflow with placeholders for steps we expect, with implementations filled in as supporting services are developed, will be similar to what we did for ocrWF.

Steps we're likely to have, in order:

  • writing media files to S3
  • (maybe) signaling that those files need to be transcribed (if simply writing them to the bucket isn't enough)
  • picking up a workflow step signaling that the transcription output has been placed in an S3 bucket
  • updating the cocina appropriately and accessioning the output files

Currently, there is placeholder workflow step code for starting the workflow, ending it, and generating speech-to-text output (this last part will very likely be broken into the multiple steps describe above).

@jmartin-sul jmartin-sul added the blocked prereqs for this ticket aren't done yet label Sep 12, 2024
@peetucket peetucket removed the blocked prereqs for this ticket aren't done yet label Sep 13, 2024
@peetucket peetucket changed the title Finish skeleton common-accessioning robot and workflow def for... captionWF? speechToTextWF? [final name TBD] Finish skeleton common-accessioning robot and workflow def for... speechToTextWF Sep 13, 2024
@peetucket peetucket changed the title Finish skeleton common-accessioning robot and workflow def for... speechToTextWF Finish skeleton common-accessioning robot and workflow def for speechToTextWF Sep 13, 2024
@peetucket peetucket self-assigned this Sep 13, 2024
@peetucket
Copy link
Member

Closed by #1342

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants