[Standard] KaaS default StorageClass v2 (scs-0211-v2) #658

martinmo · 2024-07-16T16:11:00Z

Resolves: #652

Signed-off-by: Martin Morgenstern <[email protected]>

martinmo · 2024-07-17T13:33:48Z

@joshmue can you please have a look at this. @mbuechse kindly agreed to have a look at the formal aspects.

My plan is to proceed with the conformance tests once we have merged this v2 as draft. And then, when conformance tests are ready & merged, we can proceed to stabilize it.

joshmue

LGTM, except...

Standards/scs-0211-v2-kaas-default-storage-class.md

Co-authored-by: Joshua Mühlfort <[email protected]> Signed-off-by: Martin Morgenstern <[email protected]>

Standards/scs-0211-v2-kaas-default-storage-class.md

mbuechse

Please add something about previous versions, as in https://docs.scs.community/standards/scs-0100-v3-flavor-naming
-- first, one sentence in the intro about this version, and then one section further below about previous versions and what has changed compared to those.

Signed-off-by: Martin Morgenstern <[email protected]>

joshmue · 2024-07-22T09:28:00Z

Standards/scs-0211-v2-kaas-default-storage-class.md

+Previously, the backing storage of the default storage class was required to be protected
+against data loss caused by a physical disk or host failure.
+It also contained recommendations (MAY) with regard to redundant storage across hosts
+or availability zones.
+In this revision, these requirements and recommendations have been dropped.


This clarification actually confused me a bit, as I was under the impression that protection "against data loss caused by a physical disk or host failure" was intended to be covered by "MUST NOT be backed by local or ephemeral storage".

Regarding edge setups, you concluded in #652:

the requirement of "not being backed by local storage" should remain
...

there was a quick discussion whether this is too strict from a standards point of view

example: edge clouds probably cannot fulfill this -> but this a very special case and might be solved by another hypothetical certificate scope for edge environments

There are not that many options (or use cases) to implement storage that is NOT being backed by local storage, yet NOT being protected against data loss due to disk/host failure.
So this concession to CSP's will probably create little value for CSP's, yet would reduce the value for users who may want to rely on SCS standards to be "sensible/common defaults". I do not know of any major block storage cloud offering that does not provide basic redundancy or at least snapshotting.

This clarification actually confused me a bit, as I was under the impression that protection "against data loss caused by a physical disk or host failure" was intended to be covered by "MUST NOT be backed by local or ephemeral storage".

I removed the "physical disk or host failure" sentence because we will not be able to test this as part of the conformance checks anyway.

My reasoning behind this is: even if you use the Cinder CSI, or let's say, the Rook CSI provisioner, the backing storage could in theory be misconfigured wrt to single host or disk failure tolerance, but there is, AFAIK, no reliable way to check this from inside the K8s cluster (and with minimal privileges).

Of course, our expectation would be that this failure tolerance is configured correctly by the CSP. As a compromise I can add a sentence that documents this expectation.

In my understanding, the important bit of this standard is that a provisioned volume isn't bound to the node's lifecycle, i.e., it doesn't just use something like the rancher.io/local-path or the lvm.csi.metal-stack.io provisioner.

For the test, I currently plan to check against an allowlist of provisioners (e.g., cinder.csi.openstack.org, *.rbd.csi.ceph.com, driver.longhorn.io, may need extension in the future), because I think it is the most pragmatic approach. (Going one step further, the test could even try to mount the same PVC on two different nodes.)

So this concession to CSP's will probably create little value for CSP's, yet would reduce the value for users who may want to rely on SCS standards to be "sensible/common defaults". I do not know of any major block storage cloud offering that does not provide basic redundancy or at least snapshotting.

I do not agree completely: there is a value for the user if we prevent that a "simple" provisioner is used (like local-path), e.g., volume is independent of the node, can be used on another node, etc.

Regarding testability: Alright, did not know that this that much of an important factor. 👍

I do not agree completely: there is a value for the user if we prevent that a "simple" provisioner is used (like local-path), e.g., volume is independent of the node, can be used on another node, etc.

Definitely! Yet, it's kind of a niche use case.

In my experience, one would want to have either...

some failure-safe volume for any kind of application. Maybe a trivial non-HA database installation for non-HA use cases. While availability may be of little concern, data durability often will be, making a storage backend with some hands-off redundancy very much favorable. This is the default on most clouds, which will also work with most use cases involving applications implementing data high availability/durability themselves.
OR

some not necessarily failure-safe volume (may be local/node-bound/non-redundant, possibly fast) to run applications that take care of data durability and availability on application level.

While offering a default storage class which is not redundant/failure-safe/replicated in any way, only non-local, would make e. g. "A lot of third-party software, such as Helm charts, [which] assume that a default storage class is configured" technically initially installable, it would probably also break assumptions of these charts regarding data durability, undermining this standard's motivation for an "out-of-the-box working experience" (at least some time later if/when a disk eventually fails).

That may be the outcome (yet I do not know of CSP's who are really restrained here), but no matter what, IMHO it should be spelled out more explicitly also in the "Decision" section.

Signed-off-by: Katharina Trentau <[email protected]>

fraugabel · 2024-08-27T15:14:35Z

added the omitted requirements as recommendations under Decisions

Signed-off-by: Katharina Trentau <[email protected]>

…gnCloudStack/standards into 652-kaas-default-storageclass-v2

Signed-off-by: Katharina Trentau <[email protected]>

fraugabel

necessary requirements for fail-safe data carriers have been added as recommendations, as there may be use cases that allow these to be ignored. In general, these recommendations can be adhered to and checked by selecting the right provisioner.

First draft of KaaS default StorageClass standard v2

62eaccf

Signed-off-by: Martin Morgenstern <[email protected]>

martinmo self-assigned this Jul 16, 2024

Fix linter errors

49140ec

Signed-off-by: Martin Morgenstern <[email protected]>

martinmo marked this pull request as ready for review July 17, 2024 13:30

martinmo requested review from joshmue and mbuechse July 17, 2024 13:30

joshmue reviewed Jul 17, 2024

View reviewed changes

Standards/scs-0211-v2-kaas-default-storage-class.md Outdated Show resolved Hide resolved

Remove requirement to change the default SC

f8f58b4

Co-authored-by: Joshua Mühlfort <[email protected]> Signed-off-by: Martin Morgenstern <[email protected]>

martinmo commented Jul 17, 2024

View reviewed changes

Standards/scs-0211-v2-kaas-default-storage-class.md Outdated Show resolved Hide resolved

joshmue approved these changes Jul 17, 2024

View reviewed changes

mbuechse reviewed Jul 18, 2024

View reviewed changes

martinmo mentioned this pull request Jul 18, 2024

[BUG] Conformance check for scs-0211-v1 (and v2) is broken #662

Open

martinmo added 4 commits July 18, 2024 16:41

Merge branch 'main' into 652-kaas-default-storageclass-v2

772e365

Merge branch 'main' into 652-kaas-default-storageclass-v2

85aaab1

Add information about previous standard versions

70a13ae

Signed-off-by: Martin Morgenstern <[email protected]>

Cosmetic improvement

3f8dce5

Signed-off-by: Martin Morgenstern <[email protected]>

martinmo requested a review from mbuechse July 19, 2024 13:42

joshmue reviewed Jul 22, 2024

View reviewed changes

martinmo added Container Issues or pull requests relevant for Team 2: Container Infra and Tooling SCS-VP10 Related to tender lot SCS-VP10 labels Aug 2, 2024

mbuechse added the good first issue Good for newcomers label Aug 26, 2024

martinmo assigned fraugabel Aug 27, 2024

soft requirements from v1

4a7b389

Signed-off-by: Katharina Trentau <[email protected]>

fraugabel and others added 5 commits August 28, 2024 08:40

fixed double spaces

0a22cf9

Signed-off-by: Katharina Trentau <[email protected]>

fixed blank lines

c0417fe

Signed-off-by: Katharina Trentau <[email protected]>

Merge branch 'main' into 652-kaas-default-storageclass-v2

01cff0d

content redundancy removed

71636bc

Signed-off-by: Katharina Trentau <[email protected]>

Merge branch '652-kaas-default-storageclass-v2' of github.com:Soverei…

e608475

…gnCloudStack/standards into 652-kaas-default-storageclass-v2

provisioner list

4aa390f

Signed-off-by: Katharina Trentau <[email protected]>

joshmue mentioned this pull request Aug 28, 2024

Add draft for KaaS LB standard (#169) #648

Open

anjastrunk requested a review from fraugabel August 29, 2024 06:51

fraugabel and others added 2 commits August 29, 2024 09:26

fixed trailing space

63c1cf7

Signed-off-by: Katharina Trentau <[email protected]>

Merge branch 'main' into 652-kaas-default-storageclass-v2

2ae1b6a

fraugabel approved these changes Aug 29, 2024

View reviewed changes

fraugabel merged commit f2dd384 into main Aug 29, 2024
6 checks passed

fraugabel deleted the 652-kaas-default-storageclass-v2 branch August 29, 2024 09:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Standard] KaaS default StorageClass v2 (scs-0211-v2) #658

[Standard] KaaS default StorageClass v2 (scs-0211-v2) #658

martinmo commented Jul 16, 2024

martinmo commented Jul 17, 2024

joshmue left a comment •

edited

Loading

mbuechse left a comment

joshmue Jul 22, 2024

martinmo Jul 22, 2024

joshmue Jul 23, 2024

fraugabel commented Aug 27, 2024

fraugabel left a comment

[Standard] KaaS default StorageClass v2 (scs-0211-v2) #658

[Standard] KaaS default StorageClass v2 (scs-0211-v2) #658

Conversation

martinmo commented Jul 16, 2024

martinmo commented Jul 17, 2024

joshmue left a comment • edited Loading

Choose a reason for hiding this comment

mbuechse left a comment

Choose a reason for hiding this comment

joshmue Jul 22, 2024

Choose a reason for hiding this comment

martinmo Jul 22, 2024

Choose a reason for hiding this comment

joshmue Jul 23, 2024

Choose a reason for hiding this comment

fraugabel commented Aug 27, 2024

fraugabel left a comment

Choose a reason for hiding this comment

joshmue left a comment •

edited

Loading