From 581d37c84bd737ba3b3c070cf5e5ef775e8227ad Mon Sep 17 00:00:00 2001 From: Ved Shanbhogue Date: Thu, 2 Nov 2023 12:31:51 -0500 Subject: [PATCH 1/7] First page still mentions CMQRI instead of CBQRI --- qos_header.adoc | 2 +- readme.adoc | 2 +- 2 files changed, 2 insertions(+), 2 deletions(-) diff --git a/qos_header.adoc b/qos_header.adoc index a91673f..f7ec16e 100644 --- a/qos_header.adoc +++ b/qos_header.adoc @@ -37,7 +37,7 @@ endif::[] :xrefstyle: short = RISC-V Capacity and Bandwidth QoS Register Interface -RISC-V CMQRI Task Group +RISC-V CBQRI Task Group // Preamble [WARNING] diff --git a/readme.adoc b/readme.adoc index c3e3388..bc53697 100644 --- a/readme.adoc +++ b/readme.adoc @@ -1,6 +1,6 @@ = RISC-V Capacity and Bandwidth Controller QoS Register Interface -This document is capturing discussions at the CMQRI TG and attempts to document the baseline. +This document is capturing discussions at the CBQRI TG and attempts to document the baseline. This is not official specification and everything in this document may change. Quality of Service (QoS) is the minimal end-to-end performance that is guaranteed in advance From 0c1e7613f84edd05fffa8e8c9efd33aadcf285a3 Mon Sep 17 00:00:00 2001 From: Ved Shanbhogue Date: Thu, 2 Nov 2023 12:40:40 -0500 Subject: [PATCH 2/7] add reference to Smstateen --- qos.bib | 4 ++++ qos_identifiers.adoc | 4 +++- 2 files changed, 7 insertions(+), 1 deletion(-) diff --git a/qos.bib b/qos.bib index 30437bb..a10854b 100644 --- a/qos.bib +++ b/qos.bib @@ -11,6 +11,10 @@ @electronic{SSQOSID title = {RISC-V Quality-of-Service (QoS) Identifiers}, url = {https://github.com/riscv/riscv-ssqosid} } +@electronic{STATEEN, + title = {RISC-V State Enable Extension}, + url = {https://drive.google.com/file/d/1dhI6OzVbejQbfwyBTuwK9U4VUmW8ii4o/view} +} @INPROCEEDINGS{SSAMPLE, author={Thornock, N.C. and Flanagan, J.K.}, booktitle={2000 Winter Simulation Conference Proceedings (Cat. No.00CH37165)}, diff --git a/qos_identifiers.adoc b/qos_identifiers.adoc index 4be6e68..e152fb1 100644 --- a/qos_identifiers.adoc +++ b/qos_identifiers.adoc @@ -32,7 +32,9 @@ occupancy, memory bandwidth, etc.). The Ssqosid extension cite:[SSQOSID] introduces a read/write S/HS-mode register (`sqoscfg`) to configure QoS Identifiers to be used with requests made by the -hart to shared resources. +hart to shared resources. If extension Smstateen cite:[STATEEN] is implemented +then bit (TBA) of `mstateen0` controls access to `sqoscfg` from privilege modes +less than M. ==== Device initiated requests From 577308035d56e5d85fc1af100ca532d4735785db Mon Sep 17 00:00:00 2001 From: Ved Shanbhogue Date: Thu, 2 Nov 2023 12:46:37 -0500 Subject: [PATCH 3/7] Table 2: Offset N for cc_units is not specified. --- qos_capacity.adoc | 8 +++++--- 1 file changed, 5 insertions(+), 3 deletions(-) diff --git a/qos_capacity.adoc b/qos_capacity.adoc index cca264b..4972e16 100644 --- a/qos_capacity.adoc +++ b/qos_capacity.adoc @@ -27,7 +27,7 @@ way of the cache. [[CC_REG]] .Capacity-controller QoS register layout [width=100%] -[%header, cols="^3,10,^3, 18, 5"] +[%header, cols="^3,10,^4, 18, 5"] |=== |Offset|Name |Size |Description | Optional? |0 |`cc_capabilities` |8 |<> | No @@ -37,7 +37,7 @@ way of the cache. counter value>> | Yes |24 |`cc_alloc_ctl` |8 |<> | Yes -|32 |`cc_block_mask` |`M` * 8 |<> | Yes |N |`cc_cunits` |8 |<> | Yes @@ -46,7 +46,9 @@ way of the cache. The size and offset in <> are specified in bytes. The size of the `cc_block_mask` register is determined by the `NCBLKS` field -of the `cc_capabilities` register but is always a multiple of 8 bytes. +of the `cc_capabilities` register but is always a multiple of 8 bytes. The +formula for determination of `BMW` is defined in <>. The offset `N` is +determined as `32 + BMW/8`. The reset value is 0 for the following register fields. From c470bab83ea1bbaa3f995da7a91ea8e6ad4614e8 Mon Sep 17 00:00:00 2001 From: Ved Shanbhogue Date: Thu, 2 Nov 2023 13:55:06 -0500 Subject: [PATCH 4/7] =?UTF-8?q?Capacity=20controller:=20the=20notion=20of?= =?UTF-8?q?=20=E2=80=9Coccupation=E2=80=9D=20by=20an=20RCID=20needs=20clar?= =?UTF-8?q?ification?= MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit --- qos_capacity.adoc | 20 ++++++++++++++++++++ 1 file changed, 20 insertions(+) diff --git a/qos_capacity.adoc b/qos_capacity.adoc index 4972e16..ca8ab45 100644 --- a/qos_capacity.adoc +++ b/qos_capacity.adoc @@ -24,6 +24,26 @@ allocation by ways might define a _capacity block_ to be the cache blocks in one way of the cache. ==== +The capacity allocation affects the decision regarding which _capacity blocks_ +to use when a new _capacity unit_ is requested by a workload but usually does +not affect other operations of the controller. + +[NOTE] +==== +For example, when a request is made to a cache controller, the request involves +scanning the entire cache to determine if the requested data is present. If the +data is located, then the request is fulfilled using this data, even if the +cache block containing the data was initially allocated for a different +workload. The data continues to reside in the same cache block. Consequently, +the cache lookup function remains unaffected by the capacity allocation +constraints set for the workload initiating the request. Conversely, if the data +is not found, a new cache block must be allocated. This allocation is executed +using the _capacity blocks_ assigned to the workload that made the request. +Hence, a workload may only trigger evictions within _capacity blocks_ designated +to it but can access shared data in _capacity blocks_ allocated to other +workloads. +==== + [[CC_REG]] .Capacity-controller QoS register layout [width=100%] From 9e35cc08aeac62709890d088b447b0a34fc68756 Mon Sep 17 00:00:00 2001 From: Ved Shanbhogue Date: Thu, 2 Nov 2023 14:00:38 -0500 Subject: [PATCH 5/7] it should be mentioned that iommu_qosid must be configured before modifying ddtp.iommu_mode. --- qos_iommu.adoc | 4 +++- 1 file changed, 3 insertions(+), 1 deletion(-) diff --git a/qos_iommu.adoc b/qos_iommu.adoc index a0109b3..1891468 100644 --- a/qos_iommu.adoc +++ b/qos_iommu.adoc @@ -35,7 +35,9 @@ If the reset value for `ddtp.iommu_mode` field is `Bare`, then the At reset, it is required that the `RCID` field of `iommu_qosid` be set to 0 if the IOMMU is in `Bare` mode, as typically the resource controllers in the SoC default to a reset behavior of associating all capacity or bandwidth to the -`RCID` value of 0. +`RCID` value of 0. When the reset value of the `ddtp.iommu_mode` is not `Bare`, +the `iommu_qosid` register should be initialized by software prior to changing +the mode to allow DMA. ==== ==== IOMMU capabilities (`capabilities`) From 3886d212deb6bd49d1ed3d7ce7deeda51faa3ffe Mon Sep 17 00:00:00 2001 From: Ved Shanbhogue Date: Mon, 6 Nov 2023 06:48:16 -0800 Subject: [PATCH 6/7] add RCID prefix mode --- qos_bandwidth.adoc | 12 ++++++++++-- qos_capacity.adoc | 10 +++++++++- qos_hw_guidelines.adoc | 12 ++++++------ qos_identifiers.adoc | 21 ++++++++++++++++++--- 4 files changed, 43 insertions(+), 12 deletions(-) diff --git a/qos_bandwidth.adoc b/qos_bandwidth.adoc index e6c1068..db037fe 100644 --- a/qos_bandwidth.adoc +++ b/qos_bandwidth.adoc @@ -49,10 +49,12 @@ bandwidth-controller capabilities. {reg: [ {bits: 8, name: 'VER'}, {bits: 16, name: 'NBWBLKS'}, - {bits: 8, name: 'WPRI'}, + {bits: 1, name: 'RPFX'}, + {bits: 4, name: 'P'}, + {bits: 3, name: 'WPRI'}, {bits: 16, name: 'MRBWB'}, {bits: 16, name: 'WPRI'}, -], config:{lanes: 2, hspace:1024}} +], config:{lanes: 4, hspace:1024}} .... The `VER` field holds the version of the specification implemented by the @@ -83,6 +85,12 @@ allowing all available bandwidth blocks to be reserved for allocation, the bandwidth controller can handle such transient inaccuracies. ==== +If `RPFX` is 1, the controller uses `RCID` in the requests along with `P` least +significant bits of the `MCID` carried in the request to compute an effective +`MCID` (<>) to identify the monitoring counter. Legal values of `P` range +from 0 to 12. If `RPFX` is 0, `P` is set to 0, and the effective `MCID` is the +same as the `MCID` in the request. + [[BC_MCTL]] === Bandwidth usage monitoring control (`bc_mon_ctl`) diff --git a/qos_capacity.adoc b/qos_capacity.adoc index ca8ab45..547dc1f 100644 --- a/qos_capacity.adoc +++ b/qos_capacity.adoc @@ -99,7 +99,9 @@ capacity-controller capabilities. {bits: 16, name: 'NCBLKS'}, {bits: 1, name: 'FRCID'}, {bits: 1, name: 'CUNITS'}, - {bits: 38, name: 'WPRI'}, + {bits: 1, name: 'RPFX'}, + {bits: 4, name: 'P'}, + {bits: 33, name: 'WPRI'}, ], config:{lanes: 4, hspace:1024}} .... @@ -127,6 +129,12 @@ units_ that can be occupied by an `RCID` in _capacity blocks_ allocated to it. If `FRCID` is 1, the controller supports an operation to flush and deallocate the _capacity blocks_ occupied by an `RCID`. +If `RPFX` is 1, the controller uses `RCID` in the requests along with `P` least +significant bits of the `MCID` carried in the request to compute an effective +`MCID` (<>) to identify the monitoring counter. Legal values of `P` range +from 0 to 12. If `RPFX` is 0, `P` is set to 0, and the effective `MCID` is the +same as the `MCID` in the request. + [[CC_MCTL]] === Capacity usage monitoring control (`cc_mon_ctl`) diff --git a/qos_hw_guidelines.adoc b/qos_hw_guidelines.adoc index 8b0d23c..0894657 100644 --- a/qos_hw_guidelines.adoc +++ b/qos_hw_guidelines.adoc @@ -21,15 +21,15 @@ large pool of `MCID` speeds up this analysis. [NOTE] ==== -To achieve maximum flexibility in the allocation of QoS IDs to workloads, it is +To maximize flexibility in the allocation of QoS IDs to workloads, it is recommended that all resource controllers in the system support an identical -number of `RCID` and `MCID`. Requests typically need to be processed by multiple -controllers, such as caches, fabrics, and memory controllers. In such cases, if -not all controllers accessed by a request support an identical number of IDs, -software will be constrained, using only the IDs supported by all controllers. +number of `RCID` and `MCID`, as well as a uniform mode of operation — either +direct or RCID-prefixed — for determining the effective `MCID`. Uniformity +ensures that software is not constrained by the lowest common denominator of ID +support when requests are processed by multiple controllers, such as caches, +fabrics, and memory controllers. ==== - === Sizing monitoring counters Typically software samples the monitoring counters periodically to monitor diff --git a/qos_identifiers.adoc b/qos_identifiers.adoc index e152fb1..a685b0c 100644 --- a/qos_identifiers.adoc +++ b/qos_identifiers.adoc @@ -20,13 +20,28 @@ controllers that control accesses to such shared resources. Guidelines for sizing the QoS IDs and need for differentiated IDs for monitoring is discussed in <>. +[[EMCID]] === Associating `RCID` and `MCID` with requests The `RCID` in the request is used by the resource controllers to determine the resource allocations (e.g., cache occupancy limits, memory bandwidth limits, -etc.) to enforce. The `MCID` in the request is used by the resource controllers -to identify the ID of a counter to monitor resource usage (e.g., cache -occupancy, memory bandwidth, etc.). +etc.) to enforce. + +The `MCID` in the request is used by the resource controllers to identify the ID +of a counter to monitor resource usage (e.g., cache occupancy, memory bandwidth, +etc.). Two modes of operation are supported by CBQRI. In the direct mode, the +`MCID` carried with the request is directly used by the controller to identify +the counter and is the effective `MCID`. In the RCID-prefixed mode, the +controller identifies the counter for monitoring using an effective `MCID` +computed as: + +---- + Effective MCID = (RCID << P) | (MCID & ((1 << P) - 1)) +---- + +Legal values of `P` range from 0 to 12 and are enumerated in the capability +register of the controller. Software should use the effective `MCID` as the +`MCID` operand to the controller for operations on the monitoring counters. ==== RISC-V hart initiated requests (Ssqosid) From 4586859b1aa18a703263a4d17f4c62721c046e7c Mon Sep 17 00:00:00 2001 From: Ved Shanbhogue Date: Sat, 11 Nov 2023 15:21:29 -0600 Subject: [PATCH 7/7] add note about rcid and mcid width supported by IOMMU --- qos_iommu.adoc | 14 +++++++++----- 1 file changed, 9 insertions(+), 5 deletions(-) diff --git a/qos_iommu.adoc b/qos_iommu.adoc index 1891468..26e5574 100644 --- a/qos_iommu.adoc +++ b/qos_iommu.adoc @@ -1,13 +1,17 @@ [[QOS_IOMMU]] == IOMMU extension for QoS ID -Monitoring or allocation of resources accessed by the IOMMU and devices governed -by the IOMMU requires a way to associate QoS IDs with such requests. This -section specifies a RISC-V IOMMU cite:[IOMMU] extension to: +A method to associate QoS IDs with requests to access resources by the IOMMU, as +well as with devices governed by it, is required for effective monitoring and +allocation. This section specifies a RISC-V IOMMU cite:[IOMMU] extension to: * Configure and associate QoS IDs for device-originated requests. * Configure and associate QoS IDs for IOMMU-originated requests. +The size (or width) of `RCID` and `MCID`, as fields in registers or in data +structures, supported by the IOMMU must be at least as large as that supported +by any RISC-V application processor hart in the system. + === IOMMU registers The specified memory-mapped register layout defines a new IOMMU register named @@ -109,9 +113,9 @@ The `iommu_qosid` register fields are defined as follows: [%header, cols="^1,2,^1,5"] |=== |Bits |Field |Attribute | Description -|11:0 |`RCID` |WARL | RCID for IOMMU initiated requests. +|11:0 |`RCID` |WARL | `RCID` for IOMMU-initiated requests. |15:12 |reserved |WPRI | Reserved for standard use. -|27:16 |`MCID` |WARL | MCID for IOMMU initiated requests. +|27:16 |`MCID` |WARL | `MCID` for IOMMU-initiated requests. |31:28 |reserved |WPRI | Reserved for standard use. |===