The Casper Event Sidecar is an application that runs in tandem with the node process. It's main purpose is to:
- offload the node from broadcasting SSE events to multiple clients
- provide client features that aren't part of the nodes' functionality, nor should they be
While the primary use case for the Sidecar application is running alongside the node on the same machine, it can be run remotely if necessary.
Casper Sidecar has three main functionalities:
- Providing a SSE server with a firehose
/events
endpoint that streams all events from the connected nodes. Sidecar also stores observed events in storage. - Providing a REST API server that allows clients to query events in storage.
- Be a JSON RPC bridge between end users and a Casper node's binary RPC port.
The system has the following components and external dependencies:
graph LR;
subgraph CASPER-SIDECAR
SSE_SERVER["SSE server"]
RPC_API_SERVER["RPC API server (json)"]
REST_API["Rest API server"]
ADMIN_API["Admin API server"]
end
CONFIG{{"Config file (toml)"}}
CONFIG --> CASPER-SIDECAR
STORAGE[(Storage)]
NODE_SSE(("Casper Node SSE port"))
NODE_BINARY(("Casper Node binary port"))
RPC_API_SERVER --> NODE_BINARY
SSE_SERVER --> NODE_SSE
SSE_SERVER --> STORAGE
STORAGE --> REST_API
Diving into the SSE Server, we see the following components:
graph TD;
CLIENT{Client}
CLIENT --> SSE_SERVER_API
STORAGE[("Storage")]
CONFIG{{"Config file (toml)"}}
MAIN --1.reads--> CONFIG
NODE_SSE{Node SSE port}
SSE_LISTENER --2--> STORAGE
NODE_SSE --1--> SSE_LISTENER
subgraph "Casper sidecar"
MAIN[main.rs]
MAIN --2.spawns---> SSE-SERVER
subgraph SSE-SERVER
SSE_SERVER_API["SSE API"]
RING_BUFFER["Events buffer"]
SSE_SERVER_API --> RING_BUFFER
SSE_LISTENER --3--> RING_BUFFER
subgraph "For connection in connections"
SSE_LISTENER["SSE Listener"]
end
end
end
Given the flow above, the SSE Listener processes events in this order:
- Fetch an event from the node's SSE port
- Store the event
- Publish the event to the SSE API
Casper nodes offer an event stream API that returns Server-Sent Events (SSEs) with JSON-encoded data. The Sidecar reads the event stream of all connected nodes, acting as a passthrough and replicating the SSE interface of the connected nodes. The Sidecar can:
- republish the current events from the node to clients listening to Sidecar's SSE API
- publish a configurable number of previous events to clients connecting to the Sidecar's SSE API with
?start_from=
query (similar to the node's SSE API) - store the events in external storage for clients to query them via the Sidecar's REST API Enabling and configuring the SSE Server of the Sidecar is optional.
graph LR;
CLIENT{Client}
CLIENT --> REST_API
STORAGE[("Storage")]
REST_API --> STORAGE
CONFIG{{"Config file (toml)"}}
MAIN --1.reads--> CONFIG
subgraph "Casper sidecar"
MAIN[main.rs]
MAIN --2.spawns--> REST_API
REST_API["REST API"]
end
The Sidecar offers an optional REST API that allows clients to query the events stored in external storage. Node operators can discover the specific endpoints of the REST API using [OpenAPI] (#openapi-specification) and [Swagger] (#swagger-documentation). Also, the usage instructions provide more details.
graph LR;
CLIENT{Client}
CLIENT --> ADMIN_API
CONFIG{{Config file}}
MAIN --1.reads--> CONFIG
subgraph "Casper sidecar"
MAIN[main.rs]
MAIN --2.spawns--> ADMIN_API
ADMIN_API["ADMIN API"]
end
The Sidecar offers an administrative API to allow an operator to check its current status. The Sidecar operator has the option to enable and configure this API. Please see the admin server configuration for details.
graph LR;
CLIENT{Client}
CLIENT --> RPC_API
CONFIG{{Config file}}
MAIN --1.reads--> CONFIG
CASPER_NODE(("Casper Node binary port"))
RPC_API --forwards request--> CASPER_NODE
subgraph "Casper sidecar"
MAIN[main.rs]
MAIN --2.spawns--> RPC_API
RPC_API["RPC JSON API"]
end
The Sidecar offers an optional RPC JSON API module that can be enabled and configured. It is a JSON bridge between end users and a Casper node's binary port. The RPC API server forwards requests to the Casper node's binary port. For more details on how the RPC JSON API works, see the RPC Sidecar README.
Here is an example configuration of the RPC API server:
[rpc_server.main_server]
enable_server = true
address = '0.0.0.0:7777'
qps_limit = 100
max_body_bytes = 2_621_440
cors_origin = ''
[rpc_server.node_client]
address = '127.0.0.1:28101'
max_message_size_bytes = 4_194_304
request_limit = 3
request_buffer_size = 16
[rpc_server.speculative_exec_server]
enable_server = true
address = '0.0.0.0:7778'
qps_limit = 1
max_body_bytes = 2_621_440
cors_origin = ''
[rpc_server.node_client.exponential_backoff]
initial_delay_ms = 1000
max_delay_ms = 32_000
coefficient = 2
max_attempts = 30
-
main_server.enable_server
- The RPC API server will be enabled if set to true. -
main_server.address
- Address under which the main RPC API server will be available. -
main_server.qps_limit
- The maximum number of requests per second. -
main_server.max_body_bytes
- Maximum body size of request to API in bytes. -
main_server.cors_origin
- Configures the CORS origin. -
speculative_exec_server.enable_server
- If set to true, the speculative RPC API server will be enabled. -
speculative_exec_server.address
- Address under which the speculative RPC API server will be available. -
speculative_exec_server.qps_limit
- The maximum number of requests per second. -
speculative_exec_server.max_body_bytes
- Maximum body size of request to API in bytes. -
speculative_exec_server.cors_origin
- Configures the CORS origin. -
node_client.address
- Address of the Casper Node binary port -
node_client.max_message_size_bytes
- Maximum binary port message size in bytes. -
node_client.request_limit
- Maximum number of in-flight requests. -
node_client.request_buffer_size
- Number of node requests that can be buffered. -
node_client.exponential_backoff.initial_delay_ms
- Timeout after the first broken connection (backoff) in milliseconds. -
node_client.exponential_backoff.max_delay_ms
- Maximum timeout after a broken connection in milliseconds. -
node_client.exponential_backoff.coefficient
- Coefficient for the exponential backoff. The next timeout is calculated as min(current_timeout * coefficient
,max_delay_ms
). -
node_client.exponential_backoff.max_attempts
- Maximum number of times to try to reconnect to the binary port of the node.
- CMake 3.1.4 or greater
- Rust
- pkg-config
- gcc
- g++
The SSE Sidecar service must be configured using a .toml
file specified at runtime.
This repository contains several sample configuration files that can be used as examples and adjusted according to your scenario:
- EXAMPLE_NCTL_CONFIG.toml - Configuration for connecting to nodes on a local NCTL network. This configuration is used in the unit and integration tests found in this repository
- EXAMPLE_NCTL_POSTGRES_CONFIG.toml - Configuration for using the PostgreSQL database and nodes on a local NCTL network
- EXAMPLE_NODE_CONFIG.toml - Configuration for connecting to live nodes on a Casper network and setting up an admin server
Once you create the configuration file and are ready to run the Sidecar service, you must provide the configuration as an argument using the -- --path-to-config
option as described here.
The Casper sidecar SSE server is used to connect to casper nodes, listen to events from them, store them locally and re-broadcast them to clients. The configuration for the SSE server itself is as follows:
[sse_server]
enable_server = true
emulate_legacy_sse_apis = ["V1"]
[[sse_server.connections]]
<Described later in the document>
[sse_server.event_stream_server]
<Described later in the document>
sse_server.enable_server
- If set to true, the SSE server will be enabled.sse_server.emulate_legacy_sse_apis
- A list of legacy casper node SSE APIs to emulate. The Sidecar will expose sse endpoints that are compatible with specified versions. Please bear in mind that this feature is an emulation and should be used only for transition periods. In most case scenarios having a 1 to 1 mapping of new messages into old formats is impossible, so this can be a process that looses some data and/or doesn't emit all messages that come out of the casper node. The details of the emulation are described in section Event Stream Server SSE legacy emulations module.
The Casper Sidecar's SSE component can connect to Casper nodes' SSE endpoints with versions greater or equal to 2.0.0
.
The node_connections
option configures the node (or multiple nodes) to which the Sidecar will connect and the parameters under which it will operate with that node. Connecting to multiple nodes requires multiple [[sse_server.connections]]
sections.
[[sse_server.connections]]
ip_address = "127.0.0.1"
sse_port = 18101
rest_port = 14101
max_attempts = 10
delay_between_retries_in_seconds = 5
allow_partial_connection = false
enable_logging = true
connection_timeout_in_seconds = 3
no_message_timeout_in_seconds = 60
sleep_between_keep_alive_checks_in_seconds = 30
[[sse_server.connections]]
ip_address = "127.0.0.1"
sse_port = 18102
rest_port = 14102
max_attempts = 10
delay_between_retries_in_seconds = 5
allow_partial_connection = false
enable_logging = false
connection_timeout_in_seconds = 3
no_message_timeout_in_seconds = 60
sleep_between_keep_alive_checks_in_seconds = 30
[[sse_server.connections]]
ip_address = "127.0.0.1"
sse_port = 18103
rest_port = 14103
max_attempts = 10
delay_between_retries_in_seconds = 5
allow_partial_connection = false
enable_logging = false
connection_timeout_in_seconds = 3
no_message_timeout_in_seconds = 60
sleep_between_keep_alive_checks_in_seconds = 30
ip_address
- The IP address of the node to monitor.sse_port
- The node's event stream (SSE) port. This example configuration uses port9999
.rest_port
- The node's REST endpoint for status and metrics. This example configuration uses port8888
.max_attempts
- The maximum number of attempts the Sidecar will make to connect to the node. If set to0
, the Sidecar will not attempt to connect.delay_between_retries_in_seconds
- The delay between attempts to connect to the node.allow_partial_connection
- Determining whether the Sidecar will allow a partial connection to this node.enable_logging
- This enables the logging of events from the node in question.connection_timeout_in_seconds
- Number of seconds before the connection request times out. Parameter is optional, defaults to 5no_message_timeout_in_seconds
- Number of seconds after which the connection will be restarted if no bytes were received. Parameter is optional, defaults to 120sleep_between_keep_alive_checks_in_seconds
- Optional parameter specifying the time intervals (in seconds) for checking if the connection is still alive. Defaults to 60
Please see Legacy sse emulation file
This directory stores the SSE cache and an SQLite database if the Sidecar is configured to use SQLite.
[storage]
storage_path = "./target/storage"
The Sidecar can connect to different types of databases. The current options are SQLite
or PostgreSQL
. The following sections show how to configure the database connection for one of these DBs. Note that the Sidecar can only connect to one DB at a time.
This section includes configurations for the SQLite database.
[storage.sqlite_config]
file_name = "sqlite_database.db3"
max_connections_in_pool = 100
# https://www.sqlite.org/compile.html#default_wal_autocheckpoint
wal_autocheckpointing_interval = 1000
storage.sqlite_config.file_name
- The database file path.storage.sqlite_config.max_connections_in_pool
- The maximum number of connections to the database (should generally be left as is).storage.sqlite_config.wal_autocheckpointing_interval
- This controls how often the system commits pages to the database. The value determines the maximum number of pages before forcing a commit. More information can be found here.
The properties listed below are elements of the PostgreSQL database connection that can be configured for the Sidecar.
storage.postgresql_config.database_name
- Name of the database.storage.postgresql_config.host
- URL to PostgreSQL instance.storage.postgresql_config.database_username
- Username.storage.postgresql_config.database_password
- Database password.storage.postgresql_config.max_connections_in_pool
- The maximum number of connections to the database.storage.postgresql_config.port
- The port for the database connection.
To run the Sidecar with PostgreSQL, you can set the following database environment variables to control how the Sidecar connects to the database. This is the suggested method to set the connection information for the PostgreSQL database.
SIDECAR_POSTGRES_USERNAME="your username"
SIDECAR_POSTGRES_PASSWORD="your password"
SIDECAR_POSTGRES_DATABASE_NAME="your database name"
SIDECAR_POSTGRES_HOST="your host"
SIDECAR_POSTGRES_MAX_CONNECTIONS="max connections"
SIDECAR_POSTGRES_PORT="port"
However, DB connectivity can also be configured using the Sidecar configuration file.
If the DB environment variables and the Sidecar's configuration file have the same variable set, the DB environment variables will take precedence.
It is possible to completely omit the PostgreSQL configuration from the Sidecar's configuration file. In this case, the Sidecar will attempt to connect to the PostgreSQL using the database environment variables or use some default values for non-critical variables.
[storage.postgresql_config]
database_name = "event_sidecar"
host = "localhost"
database_password = "p@$$w0rd"
database_username = "postgres"
max_connections_in_pool = 30
This information determines outbound connection criteria for the Sidecar's rest_server
.
[rest_api_server]
enable_server = true
port = 18888
max_concurrent_requests = 50
max_requests_per_second = 50
request_timeout_in_seconds = 10
enable_server
- If set to true, the RPC API server will be enabled.port
- The port for accessing the sidecar'srest_server
.18888
is the default, but operators are free to choose their own port as needed.max_concurrent_requests
- The maximum total number of simultaneous requests that can be made to the REST server.max_requests_per_second
- The maximum total number of requests that can be made per second.request_timeout_in_seconds
- The total time before a request times out.
[sse_server.event_stream_server]
port = 19999
max_concurrent_subscribers = 100
event_stream_buffer_length = 5000
The sse_server.event_stream_server
section specifies a port for the Sidecar's event stream.
Additionally, there are the following two options:
event_stream_server.port
- Port under which the SSE server is published.event_stream_server.max_concurrent_subscribers
- The maximum number of subscribers that can monitor the Sidecar's event stream.event_stream_server.event_stream_buffer_length
- The number of events that the stream will hold in its buffer for reference when a subscriber reconnects.
This optional section configures the Sidecar's administrative server. If this section is not specified, the Sidecar will not start an admin server.
[admin_api_server]
enable_server = true
port = 18887
max_concurrent_requests = 1
max_requests_per_second = 1
enable_server
- If set to true, the RPC API server will be enabled.port
- The port for accessing the Sidecar's admin server.max_concurrent_requests
- The maximum total number of simultaneous requests that can be sent to the admin server.max_requests_per_second
- The maximum total number of requests that can be sent per second to the admin server.
Access the admin server at http://localhost:18887/metrics/
.
Once the Sidecar is running, access the Swagger documentation at http://localhost:18888/swagger-ui/
. You need to replace localhost
with the IP address of the machine running the Sidecar application if you are running the Sidecar remotely. The Swagger documentation will allow you to test the REST API.
An OpenAPI schema is available at http://localhost:18888/api-doc.json/
. You need to replace localhost
with the IP address of the machine running the Sidecar application if you are running the Sidecar remotely.
You can run the unit and integration tests included in this repository with the following command:
cargo test
You can also run the performance tests using the following command:
cargo test -- --include-ignored
The EXAMPLE_NCTL_CONFIG.toml file contains the configurations used for these tests.
After creating the configuration file, run the Sidecar using Cargo and point to the configuration file using the --path-to-config
option, as shown below. The command needs to run with root
privileges.
sudo cargo run -- --path-to-config ./resources/example_configs/EXAMPLE_NODE_CONFIG.toml
The Sidecar application leverages tracing, which can be controlled by setting the RUST_LOG
environment variable.
The following command will run the sidecar application with the INFO
log level.
RUST_LOG=info cargo run -p casper-sidecar -- --path-to-config ./resources/example_configs/EXAMPLE_NCTL_CONFIG.toml
The log levels, listed in order of increasing verbosity, are:
ERROR
WARN
INFO
DEBUG
TRACE
Further details about log levels can be found here.
The Sidecar application can be tested against live Casper nodes or a local NCTL network.
The configuration shown within this README will direct the Sidecar application to a locally hosted NCTL network if one is running. The Sidecar should function the same way it would with a live node, displaying events as they occur in the local NCTL network.
This section covers helpful tips when troubleshooting the Sidecar service. Replace the URL and ports provided in the examples as appropriate.
To check whether the Sidecar is running, run the following curl
command, which returns the newest stored block.
curl http://SIDECAR_URL:SIDECAR_REST_PORT/block
Each block should have a .block.header.timestamp
field. Even if there were no deploys, a block should be produced every 30-60 seconds. If the latest block falls behind, it means there is an issue with the Sidecar reading events from the node. Here is a helpful script provided jq
is installed:
curl http://SIDECAR_URL:SIDECAR_REST_PORT/block | jq '.block.header.timestamp'
Checking the node connection status requires the admin server to be enabled, as shown here. Use this curl
command and observe the output:
curl http://SIDECAR_URL:SIDECAR_ADMIN_PORT/metrics
Sample output:
# HELP node_statuses Current status of node to which sidecar is connected. Numbers mean: 0 - preparing; 1 - connecting; 2 - connected; 3 - reconnecting; -1 - connections_exhausted -> used up all connection attempts ; -2 - incompatible -> node is in an incompatible version
# TYPE node_statuses gauge
node_statuses{node="35.180.42.211:9999"} 2
node_statuses{node="69.197.42.27:9999"} 2
In the above node_statuses
, you can see which nodes are connecting, which are already connected, which are disconnected due to no more retries, etc. The number next to each node represents the connection status:
0
- The Sidecar is preparing to connect1
- The Sidecar is connecting to the node2
- The Sidecar is connected to this node3
- The Sidecar is reconnecting-1
- The Sidecar is not connected and has reached the maximum connection attempts-2
- The Sidecar is not connected due to an incompatible node version
To diagnose errors, look for error
logs and check the error_counts
on the metrics page, http://SIDECAR_URL:SIDECAR_ADMIN_PORT/metrics
, where most of the errors related to data flow will be stored:
# HELP error_counts Error counts
# TYPE error_counts counter
error_counts{category="connection_manager",description="fetching_from_stream_failed"} 6
To monitor the Sidecar's memory consumption, observe the metrics page, http://SIDECAR_URL:SIDECAR_ADMIN_PORT/metrics
. Search for process_resident_memory_bytes
:
# HELP process_resident_memory_bytes Resident memory size in bytes.
# TYPE process_resident_memory_bytes gauge
process_resident_memory_bytes 292110336
If memory consumption is high without an apparent reason, please inform the Sidecar team by creating an issue in GitHub.
Remember to check the event_stream_buffer_length
setting in the configuration because it dramatically impacts how much memory the Sidecar consumes. Also, some events, like step events, consume more memory.
Ensuring enough space in the database is essential for the Sidecar to consume events produced from the nodes' SSE streams over a more extended period. Each event is written to the database in a raw format for future processing. Running the Sidecar for an extended period (weeks or months) can result in storing multiple Gigabytes of data. If the database runs out of space, the Sidecar will lose events, as it cannot record them.
The easiest way to inspect the Sidecar’s REST API is with Swagger.
The Sidecar can be configured to limit concurrent requests (max_concurrent_requests
) and requests per second (max_requests_per_second
) for the REST and admin servers.
However, remember that those are application-level guards, meaning that the operating system already accepted the connection, which used up the operating system's resources. Limiting potential DDoS attacks requires consideration before the requests are directed to the Sidecar application.