docs: 🏗️ move `.puml` into pseudocode #1021

lwjohnst86 · 2025-01-29T09:19:00Z

Description

This PR moves the PlantUML diagram over into pseudocode and also adds a basic Mermaid diagram of the input and output flow.

This PR needs an in-depth review.

Checklist

Updated documentation
Ran just run-all

docs/design/interface/pseudocode/write_resource_data_to_raw.py

lwjohnst86 · 2025-01-29T09:22:53Z

docs/design/interface/pseudocode/write_resource_data_to_raw.py

+
+    - Can it be at a minimal read without problems or warnings?
+    - Do the columns in the data file match those in the properties?
+    - Do the data types in the data file match those in the properties?


These were the initial ones I was thinking about, but I guess as we use it in examples and real-world data, we could add more. Could even eventually move this function out into the checks package.

docs/design/interface/python-functions.qmd

martonvago

Makes sense in general, just added some comments!

docs/design/interface/pseudocode/write_resource_data_to_raw.py

martonvago · 2025-01-29T12:01:44Z

docs/design/interface/pseudocode/write_resource_data_to_raw.py

+    Copy the file from `data_path` over into the resource location given by
+    `path`. This will compress the file and use a timestamped, unique file
+    name to store it as a backup. See the
+    [design](https://sprout.seedcase-project.org/docs/design/) docs for an
+    explanation of this file. Use `path_resource_raw()` to provide the
+    correct `path` location. Copies and compresses the file, and outputs the
+    path object of the created file.


Maybe include that the data is checked against the metadata?

docs/design/interface/pseudocode/write_resource_data_to_raw.py

docs/design/interface/python-functions.qmd

Co-authored-by: martonvago <[email protected]>

docs/design/interface/pseudocode/write_resource_data_to_raw.py

lwjohnst86 · 2025-01-31T08:24:51Z

docs/design/interface/pseudocode/write_resource_data_to_raw.py

@@ -0,0 +1,116 @@
+# ruff: noqa
+def write_resource_data_to_raw(data_path, resource_properties) -> Path:


I removed the path so we can use the path properties instead. One thing we need to consider is the location of where this function will run. We either need to figure a way to give an absolute path, or restrict this function to only running in a directory that has a datapackage.json (so it know's where root is). Or, we have a function to seek out what the root of the package is, if this is run from a subfolder.

That's a good point! Allowing it to run in the root folder or any subfolder of that seems okay to me.

martonvago · 2025-01-31T09:08:28Z

docs/design/interface/pseudocode/write_resource_data_to_raw.py

+    check_is_supported_format(data_path)
+    check_data_basics(data_path, resource_properties)
+    check_data_constraints(data_path, resource_properties)
+    raw_dir = Path(resource_properties.path / "raw")


I think path is resources/id/data.parquet, but this a minor point.

lwjohnst86 added 3 commits January 29, 2025 10:01

docs: 🏗️ move .puml diagram into pseudocode

6f75fcf

docs: 🏗️ move description into pseudocode, plus try Mermaid

bc61f02

docs: 🐛 need to use features specific to Mermaid version

7f7b090

lwjohnst86 requested a review from a team as a code owner January 29, 2025 09:19

github-actions bot assigned lwjohnst86 Jan 29, 2025

lwjohnst86 commented Jan 29, 2025

View reviewed changes

martonvago requested changes Jan 29, 2025

View reviewed changes

lwjohnst86 and others added 3 commits January 31, 2025 09:12

docs: ✏️ small typos from review

2c84bd2

Co-authored-by: martonvago <[email protected]>

docs: 📝 clarify about checking metadata for raw data

7c143b9

docs: 🏗️ remove path argument, get it from properties

af19134

lwjohnst86 commented Jan 31, 2025

View reviewed changes

docs: 🏗️ remove path from Mermaid diagram

414606a

lwjohnst86 requested a review from martonvago January 31, 2025 08:37

martonvago approved these changes Jan 31, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: 🏗️ move `.puml` into pseudocode #1021

docs: 🏗️ move `.puml` into pseudocode #1021

lwjohnst86 commented Jan 29, 2025

lwjohnst86 Jan 29, 2025

martonvago left a comment

martonvago Jan 29, 2025

lwjohnst86 Jan 31, 2025

martonvago Jan 31, 2025

martonvago Jan 31, 2025

		@@ -0,0 +1,116 @@
		# ruff: noqa
		def write_resource_data_to_raw(data_path, resource_properties) -> Path:

docs: 🏗️ move .puml into pseudocode #1021

Are you sure you want to change the base?

docs: 🏗️ move .puml into pseudocode #1021

Conversation

lwjohnst86 commented Jan 29, 2025

Description

Checklist

lwjohnst86 Jan 29, 2025

Choose a reason for hiding this comment

martonvago left a comment

Choose a reason for hiding this comment

martonvago Jan 29, 2025

Choose a reason for hiding this comment

lwjohnst86 Jan 31, 2025

Choose a reason for hiding this comment

martonvago Jan 31, 2025

Choose a reason for hiding this comment

martonvago Jan 31, 2025

Choose a reason for hiding this comment

docs: 🏗️ move `.puml` into pseudocode #1021

docs: 🏗️ move `.puml` into pseudocode #1021