-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feedback on chapter 2 and chapter 3 #79
Comments
Additional comments:
|
Hi @ved-rivos and ARC team, thanks for the feedback. Provided the answers below.
A normal channel is a shared memory queue which is shared among multiple RPMI clients and normal channel transmits a RPMI message which has RPMI message header. A fast channel is a shared memory without any message format which transmits raw data usually few bytes (though its not a limitation). A service group if it requires fastchannel, then discovery of the fastchannel details, semantics, which services can send the data over fastchannel are defined by that particular service group. Yes, a service group supporting fastchannel can use both normal RPMI message via normal channel fastchannel. A service group has to define, if fastchannel is supported and implemented, then which services in that service group will only use fastchannel to send the data. For example, the CPPC service group defines the fastchannel support (Section: 4.5.1. CPPC Fast-channel) and also provide the mechanism to discover its support (Section: 4.5.7). It also mentions where AP should write the desired performance level based on the fastchannel availability.
Noted, will update.
Notification is a RPMI message with its own message format different from P2A Request message which is a request from Platform(PuC) to Application Processor. Request message is meant to carry a command which needs to be serviced by the service provider like PuC. Notification message is only to report events Acknowledgement for notification message is not necessary because events which are carried by notification message are only meant for logging or reporting purposes. AP can choose to subscribe to these notification messages for any service group and even if it has subscribed to them it's upto AP and implementation of that service group to decide what to do with the events.
Noted, will update
For implementation the indices for the message slots only matters which represents the actual queue. The text in this section and the notation We can add a non-normative text and elaborate this.
The cache maintenance will involve using cache flush/invalidate operation whenever the head and tail are updated.
RPMI spec defines the role of Head for dequeuing and Tail for enqueuing. It also states that ownership of head and tail is exclusive and only one entity is allowed to write to any of head or tail according to their role as producer or consumer. A queue has a message transmission direction associated which implicitly defines who will be the producer and who will be the consumer. There are special wrap indicators defined to highlight the wrapping of the queue. Checking empty and full can be done by only head and tail. Currently, there are no queue empty/full conditions explicitly defined in the specification because these are obvious based on above semantics. We certainly don’t mind defining these conditions.
Multiple application processors share the same RPMI queues and the messages in the queues are not bound to a particular application processor. Now it is up to the software running on the application processors about how to synchronize the access to the queues. Regarding interrupt to the application processors, there is only one interrupt defined for RPMI transport in P2A direction. This interrupt can be wired interrupt or MSI. If P2A interrupt is wired interrupt then it is managed through PLIC or APLIC which allows application processors to select the target processor. If P2A interrupt is MSI then the application processor can configure IMSIC coordinates (MSI address + data) of target processor using BASE service group.
The RPMI shared memory may or may not be cache-coherent for application processor and it may also be some on-chip RAM. This means the platform must set up PMA for the RPMI shared memory so that both PuC and application processors can coherently access it. The number of PMA regions supported for a RISC-V implementation is generally limited and non-contiguous RPMI shared memory required multiple PMA regions hence the note in the Section 2.3.3 suggests. We can explicitly mention that defining these attributes is the part of platform/implementation.
Noted, will update.
The shared memory queues (including the head and tail slots) are initialized by the implementation (aka PuC). From the AP perspective the shared memory is initialized and ready to use and details of queues as mentioned in Section 2.3.4 are provided to the AP. Upon kexec, the application processors can simply resume from the current head and tail values without any special coordination. Please note that kexec is a Linux feature and we can have any OS or firmware running on the application processors.
A normal message communication usually involves sending a message and then expecting a response hence the term “normal request”. Now in limited cases, we may not have any response so we choose the term “posted request” for such cases. There was a good consensus for the above terms among folks who have already reviewed the spec and a lot of software is already written using these terms. Unless absolutely required, we would like to continue with these terms.
Message header is fixed, message data length is variable which makes message length variable. But message length cannot be greater than the Slot Size. And a message cannot occupy multiple slots. The multipart message is different from this which i have explained in point 17.
Transport doorbell are optional. Spec mentions that even if doorbells are supported on either side they can ignore and do the polling. Thats why the Yes, this can be used for a particular message or for the entire lifecycle. For example if AP and PuC are capable for MSIs and AP has configured MSI details via defined service in Section 4.1.10 then this bit can always be enabled so that PuC always send MSI for each response. But AP can selectively disable it so that PuC in that case does not trigger a doorbell. RPMI spec can elaborate this more for this flag and we can update this.
The message token helps application processors track the origin of the request when it sees a response. It is generally recommended to have monotonically increasing token numbers and the token number can be initialized to any value and there is no constraint here. PuC does not check on the token number since this is only for application processors. For Kexec, the application processors must start from a new value of token so no special handling required.
A RPMI message is valid even if DATALEN field is 0 for example the service defined in Section 4.1.5 which does not have any request data and the Request is identified from ServiceGroupID and ServiceID. Implementations can perform the datalen checks. For example, the datalen field value received in an acknowledgement must not be greater than whats RPMI spec has defined for that service acknowledgement. More error checks can also be done by implementations. These checks are not defined by the RPMI specifications.
A multipart acknowledgement means that the data which is supposed to be sent in acknowledgement is greater and cannot be accommodated in a single message. In such cases there are extra fields defined in that service REMAINING and RETURNED to show that some data is remaining and requester must again In this case the multiple request messages made to get the complete data are considered as separate messages. Now since these are considered as different messages the STATUS field is also taken independent. AP in such cases must consider all STATUS together after all transactions are complete and make decisions accordingly. |
@dhaval-rivos Thanks for your feedback, will reply to your queries asap |
In Chapter 1. we have already expanded the scope of RPMI and who can implement it and also making it implicit that not every service group defined in the RPMI spec is meant for PuC.
This image is not meant to encompass everything defined in this spec and rather only present an overview.
Noted. Will update
I will remove that text from figure-2. But actually it just meant to highlight that a RPMI transport in SEE has SEE as owner which is implicit.
Noted, will update.
There are two scenarios possible - One is that ACK not getting to the application processor can happen either due to PuC not able to send ACK or issue with the transport. In such cases the type of corrective action will depend on the service being called and the RPMI clients. Similarly, a service can also return ACK with RPMI_ERR_TIMEOUT and corrective action is again depends on that service called on RPMI client side. For example a voltage change request may timeout when its implementation at PuC side is not able to service the request in time due to slow PSU or VRM. In such case the corrective action depends on RPMI client if it wants to retry again asap, or retry after sometime or any other measure.
Yes, this is little subjective and was also discussed with others and we decided to go with ALREADY as we found it more generic and appropriate for conditions which are already in progress or already happened. INPROGRESS was not chosen since it made more sense to use it only for in progress actions.
The specification only mandates to assign one MPXY channel to have one RPMI service group which means that two service groups cannot share one MPXY channel. But the implementation can have multiple MPXY channels with same service group mapped either in a single RPMI transport instance or across multiple RPMI transport instances in an implementation.
Noted, will update.
If there are no events supported then it can simply return RPMI_ERR_NOT_SUPPORTED. But every service group has to follow requirements 1 and 2 and 3 as stated in Chapter 4 starting and need to implement the ENABLE_NOTIFICATION service.
The discussion in the meeting on ReqFwd and MM concluded that the memory will already be allocated to avoid passing memory physical address every time which may require PMP mapping/unmapping and sanitization of pased memory addresses every time and its allocation is mandatory which led to adding these attributes. Passing 0 will reflect that its not present which will defy that. In case of EDK2 even if these shared memory details are made static then EDK2 non secure domain client can ignore these attributes instead of passing them as 0 from the MM service provider in secure domain.
Same as mentioned for question 10. |
There are at least two ways to implement the head and tail for a circular buffer:
The updated specification should be normative about which of the two schemes is intended. Based on the latest update it seems like the second scheme is the intent. If so that should be in normative text. |
The section 2.2.4 does not mention the details about the initialization. If the intent is for the PuC to initialize the queue head and tail then it should be mentioned. kexec is a linux concept but similar mechanisms such as vmphu may apply to other OSs. If an OS is expected to continue from wherever the head and tail happen to be then such an OS may receive notifications that it has not subscribed to or may get ACK for requests it has never made. This should be noted and based on the comments in this thread seems like they are expected to be benignly ignored. |
It is better to rename the flag as "doorbell interrupt is requested". The description of the flag states "inform the platform micro-controller to ring the doorbell". Is this flag not applicable to requests sent in P2A direction? The treatment of the flag[3] being set to 1 when doorbell interrupts are not enabled or not supported is undefined. Further, based on this description this flag seems to be not applicable in an ACK. |
The specification does not state anywhere that the TOKEN in a request must be echoed back in an ACK for the purposes of such correlation. Further describing this as a "sequence number" is misleading. A better description should be used as these are primarily message tags and not used to identify any form of sequence of messages. Please provide some explanation about the use of a TOKEN in a notification. Also when a channel is shared by multiple application processor harts then what method is used to determine which hart should process the notification message and whether events for multiple harts be packed into a single notification message. |
Noted, will make the conditions as normative text |
Noted, Can this be a non-normative text explaining this condition? |
This flag is applicable to Normal Requests from both AP or PuC. A sender if sets this bit without having doorbell interrupts enabled or without support will result in undefined behavior. We can mention this as a normative text. |
Specification does mention that TOKEN needs to be preserved, this is mentioned in Section 3.2.2 just below the RPMI Message Header table. Its true that these are not used as sequence number among a set of message sent over a period. Specification can clarify this in a non-normative text since this is not any requirement but just explanation what TOKEN represents.
Though TOKEN in a notification message does not serve a purpose but since RPMI has a common message header it has become part of notification message. I agree and specification can mention this wrt notification message. Noted, will update |
Yes. Non-normative text explaining this condition would suffice. |
The use of term "sequence number" is misleading. Since these are not sequence numbers, the specification should not use that term. For instance, one use of the token would be to fill it with the hart_id. |
Few further questions on the RPMI spec:
This implies there is some hidden state - "MSI is configured" - associated with the P2A doorbell. The BASE_SET_MSI message allows setting of this hidden state but there is no way to unset this state.
This poses an undue constraint on design of the platform. When the platform supports multiple chiplets or multiple sockets, the desire to group them all together may not be acceptable.
Besides the question 2, the syntax and semantics of the doorbell - DB_ADDR, DB_SETMASK, DB_PRESERVEMASK - are not specified in this specification. |
This configuration is required to configure the MSI details if the MSI based doorbell is preferred by the APs. Now, irrespective of MSI support and configuration of details using this service, the requests made by the AP to PuC should also have the flags[3] bit set so that PuC will use the MSI doorbell. If that flag is not set then irrespective of the configured MSI details the MSI will not be used. This is the rationale for current details. I agree that this service for MSI details configuration must also be complete which will require support to disable the details. I will update this service to mention that if this service is called again with MSI_ADDR = 0 and MSI_DATA = 0 then earlier MSI configured details will render unconfigured.
We decided to group the fastchannels according to the Read and Write done by the AP and PuC. The Performance Request fastchannels are Write by the AP and Read by the PuC. Similarly the Performance Feedback fastchannels are Write by the PuC and Read by the AP. Another possibility we discussed earlier to group both fastchannels per AP. Which means that performance request and performance feedback will be adjacent to each other. But, in this approach, depending on the system there was a likelihood of performance issues such as false sharing. Also we couldn't leave this layout open without any requirement and decided to have current mentioned approach. What is your suggestion?
Noted, will add details for these fields, what they represent and how they should be used by the implementations. |
Noted, will remove the use of "sequence number" for TOKEN and simply mention that its a integer tag/number |
The current approach is much more suitable and convenient for multiple chiplets scenario because a single fastchannel region minimizes the required PMA regions and PMP regions on AP side. In fact, the multi-chiplet scenario is already modeled in our QEMU based PoC. If a platform still wants separate shared memory for each chiplet then platform can easily create separate RPMI shared memory transports for each chiplet where the per-chiplet RPMI shared memory transport will only provide services local to that chiplet cluster. This is also modeled in the past and works fine with the current approach. Now grouping the Request channels together and Feedback channels together was considering a platform where the fastchannel region is not cache-coherent and requires explicit cache-maintainence. In such scenario, if Request channel and Feedback channel of HART are clubbed together then it becomes complicated to do cache-maintainence since both have different direction of data movement. |
I think there are certain assumptions here such as the fast channel being in memory that needs cache maintenance. The fast channel registers may be just memory mapped IO registers. The design already supports the discovery of the offset from the base of the region for each hart. That suffices and there need not be a mandate that they need to be grouped in any form. There may not be a PMA or PMP constraint depending on the platform or the constraints may be different for different platforms. The platforms can choose the most optimal layout. Since the offset of each register in the channel per hart can be discovered using GET_FAST_CHANNEL_OFFSET, the consumers of RPMI need not care about the order in which they are grouped. |
Why would address value of 0 and data value of 0 not be a valid MSI address/data pair? MSI addresses are required to be naturally aligned to 4 bytes. The bits 1 of the MSI low address may be defined as reserved/ignored and bit 0 as an enable. |
I agree. From RPMI spec perspective, a single fastchannel region and the individual harts having offsets within this region should be sufficient. |
Yes, I agree, having 0 from spec perspective which can be a valid address is not right. |
Yes, the region could describe all of addressable memory. Effectively the region is providing a some base for the offsets. |
In section 4.6, a description of what the VOLTAGE_MIN_SEL/VOLTAGE_MAX_SEL mean would be helpful. Its not clear in the specification how the multi-linear format should be interpreted. |
Noted, will add more details |
Usually clocks and voltages are related i.e.a voltage domain would expect the voltage be appropriate for the current selected clock as part of dynamic voltage/frequency scaling. Is that comprehended in some form? How are the voltage domain and clock IDs expected to be correlated? |
Section 4.8.1 - the power state values 15:0 seem to encode CONTEXT state in them - only two states are defined: ON with context preserved and OFF with context is lost. Is that the intent? Or was the encoding meant to just state OFF with the bit 16 providing an indication whether context is lost or preserved? |
No, the voltage and clock domains are considered separate from RPMI perspective even though there are dependencies among them in implementation and these are left for the platform/implementation and the drivers. Here the voltage and clocks services are mainly meant for the system clocks or system regulators apart from the CPU. For example, UART, Accelerator, etc. For DVFS like you asked, we have defined separate service groups CPPC (ACPI CPPC) and PERFORMANCE which are meant for CPU and Device performance scaling. These groups works on a defined performance scale(semantics are separate for CPPC and PERFORMANCE) and platform/implementation is responsible for translating the performance scale into the hardware operating points which may not be just voltage and frequency but will have more hardware control knobs. |
You could use data value of 0 as implying the MSI is not valid as the interrupt ID of 0 is not a valid interrupt ID. |
We should not assume anything about MSI controller in a platform. Its better to have explicit mechanism to enable/disable MSI in Base service group. |
Agree. |
I also inclined towards having explicit service for enable/disable the MSI. Configure MSI with addr and data will complement that |
Yes, this is intentional. For the defined states ON and OFF, when in the ON state, we expect CONTEXT to be preserved. However, in the OFF state, CONTEXT is lost. For vendor-specific states, vendors can define their own CONTEXT requirements. |
Its not clear why this overloading is required. The protocol correctly defines a power state and context field in the interface. There does not seem to be a need to overload the context value using a power state value. If the PuC does not support particular combination of power state and context - e.g., off without losing context - then it can deny that and this is also quite well specified in the protocol. What motivates creating a power state encoding that overrides context? |
A few comments on section 4.9:
|
These are only typical use cases. Will remove the context overloading for the ON/OFF states, leaving it to the PuC firmware to reject unsupported cases. |
|
Update based on feedback from ARC review. riscv-non-isa#79 Signed-off-by: Ley Foon Tan <[email protected]>
Section 4.4.9 - explain relation between EXIT_LATENCY and WAKEUP_LATENCY |
Hi @ved-rivos On the above question, the events are service groups specific and multiple events can be packed in a single notification message. Upon the arrival of a notification message, its really up to the implementation how it wants to consume the events in that message and its upto the implementation if it wants to have specific AP or any AP to consume the events. Are you expecting that spec should have a note about this? |
Noted, will update |
- Fix rate_limit to transition_latency - Add more description for PERF_SET_LIMIT riscv-non-isa#79 (comment) Signed-off-by: Ley Foon Tan <[email protected]>
Hi @ved-rivos, On the above question, there are no hidden states and all the states including the intermediate states |
- Fix rate_limit to transition_latency - Add more description for PERF_SET_LIMIT riscv-non-isa#79 (comment) Signed-off-by: Ley Foon Tan <[email protected]>
- Fix rate_limit to transition_latency - Add more descriptions for PERF_SET_LIMIT riscv-non-isa#79 (comment) Signed-off-by: Ley Foon Tan <[email protected]>
- Fix rate_limit to transition_latency - Add more descriptions for PERF_SET_LIMIT riscv-non-isa#79 (comment) Signed-off-by: Ley Foon Tan <[email protected]>
For instance, the HSM_START_STOP specification states: This implies that there is a piece of state in the microcontroller that says "detect next WFI invocation and
If the intention is that services such as SYSSYSUP or HART_STATE_MANAGEMENT are only supported over M-mode |
Hi @ved-rivos Please check the "Table 7. RPMI Service Groups" in the Chapter 4 Service groups. This list mentions about the priv modes too. |
Do fast and normal channels co-exist between same pair of producers
and consumers? If so what is the criteria for selecting between the two.
being avoided.
request? Do notifications not require an Ack?
effective size of the buffer is M-2, there is already a need to special case
the wrap decision.
head and tail slots.
wrap indicator or is the convention that tail = head-1 indicates a full queue?
number of application processors indicates that this queue is expected to be
concurrently operated on by multiple application processors? If so then in the
P2A direction: how is the de-multiplixing of requests/notifications/acks
performed and which application processor fields the interrupts?
entries. Each location in memory has a PMA. Please uplevel this note to
explain the intent of the note and call out what type of optimization
are intended.
and the queue memory. What coordination mechanisms are used to re-init the
queue (for instance on a kexec).
occupy multiple slots?
the producer must explicitly ring a doorbell. Is "enabled/disabled" meant to indicate
that doorbells must not be used for a queue or does it indicate that they are not
used for this specific message?
to be monotonic? What are the sequence numbers initialized to? Are there any error
checks for missing sequence numbers? How are sequence numbers synchronized
(for instance after a kexec).
a zero length message as the header does not have any operation code?
Can acknowledgements for two different messages be interleaved? Does the STATUS
field of each multipart message need to be identical and if not what are the rules for
the overall STATUS?
The text was updated successfully, but these errors were encountered: