Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat(data-warehouse): V2 pipeline release #27732

Merged
merged 5 commits into from
Jan 22, 2025
Merged

Conversation

Gilbert09
Copy link
Member

Changes

  • Releases V2 pipeline to the masses!!!
  • Removed the old pipeline code, the V2 pipeline will run on the V1 workers going forward
    • Follow up PR incoming to reduce V1 worker size and to delete the V2 workers
  • Removed all references to V2 code
  • Added some code to handle missing fields from API sources (e.g. zendesk/stripe) with the correct types
    • e.g. converting datetimes from strings to actual python datetime objects
    • and populating missing fields with nulls with the correct pyarrows type
    • This is to keep compatibility with existing deltalake tables in S3
  • Set all new jobs to be billable - no longer conditional on the pipeline version

Does this work well for both Cloud and self-hosted?

Aye

How did you test this code?

  • test_end_to_end.py tests have been a life saver here, even the super simple zendesk/stripe tests !!!!!
  • some more local testing to be done to ensure compatibility with existing S3 deltatables, doing this now

@Gilbert09 Gilbert09 requested a review from a team January 21, 2025 15:19
Copy link
Member

@EDsCODE EDsCODE left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

huge

@@ -209,14 +157,6 @@ def parse_inputs(inputs: list[str]) -> ExternalDataWorkflowInputs:
async def run(self, inputs: ExternalDataWorkflowInputs):
assert inputs.external_data_schema_id is not None

if settings.TEMPORAL_TASK_QUEUE != DATA_WAREHOUSE_TASK_QUEUE_V2 and not TEST:
await workflow.execute_activity(
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

As mentioned on standup, I believe with this, any workflow that gets resumed that has triggered this will end up in a bad state. (Should be minimal though so we can just reboot)

@Gilbert09 Gilbert09 merged commit 27f888c into master Jan 22, 2025
91 of 92 checks passed
@Gilbert09 Gilbert09 deleted the tom/release-v2-pipeline branch January 22, 2025 16:08
Gilbert09 added a commit that referenced this pull request Jan 22, 2025
@Gilbert09 Gilbert09 restored the tom/release-v2-pipeline branch January 22, 2025 16:40
fuziontech added a commit that referenced this pull request Jan 22, 2025
* master: (103 commits)
  feat(postgres-estimated-rows): pg Estimated Rows on Data Warehouse Sync (#27634)
  fix: revert darkmode class toggle, updated content on fills (#27783)
  chore: upgrade posthog-js (#27790)
  chore(editor-3001): add back join actions (#27740)
  feat: Add person distinct ID overrides squash job (as dagster job) (#27710)
  fix(created-by-sources): Adding `created_by` to sources (#27751)
  Revert "feat(data-warehouse): V2 pipeline release " (#27791)
  fix: typo for feature flags (#27786)
  fix(defer-unmounting): Defer unmounting of react elements (#27742)
  feat(data-warehouse): V2 pipeline release (#27732)
  fix(data-warehouse): Ensure dates are actual datetime formats (#27777)
  fix: enable hot reload for the products dir (#27746)
  fix: assignee selector when null (#27737)
  chore: clarify rrweb imports (#27776)
  chore(deps): Update posthog-js to 1.207.3 (#27779)
  feat(retention): filters on start/return event (#27770)
  fix(experiments): only show supported math functions (#27589)
  feat(web-analytics): Set unique conversions graph when adding conversions goal (#27774)
  chore: color design system part 1: banner and accents (#27756)
  chore(experiments): Add tests for funnel attribution options (#27752)
  ...
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants