Skip to content

Pipeline to simulate, process, and model discogs rec user feedback using AWS Lambda, S3, Snowflake, and Airflow.

Notifications You must be signed in to change notification settings

justinpakzad/discogs-rec-pipeline

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Discogs Rec Pipeline

The Discogs Rec Pipeline simulates synthetic user feedback data based on recommendations from the discogs-rec app. Utilizing AWS Lambda, S3, Snowflake, and Airflow, the pipeline handles data generation, processing, and modeling to create a workflow for managing and analyzing user interaction data.

Data Flow Diagram

Alt text

Repository Structure

  • airflow/: Contains code related to Airflow.
  • src/: Core project files, divided into:
    • lambdas/: AWS Lambda functions for data generation and transformation.
    • scripts/: Python scripts for transformations and utility functions.
    • sql/: SQL scripts for table creation, loading, and transformations.
    • analytics: Queries for analyzing user behaviour and recommendation feedback.

Data Model

Alt text

About

Pipeline to simulate, process, and model discogs rec user feedback using AWS Lambda, S3, Snowflake, and Airflow.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published