Name	Name	Last commit message	Last commit date
Latest commit Rafał Chabowski Handle error codes recently added to binary port Jun 27, 2024 4979d3f · Jun 27, 2024 History 643 Commits
.github/workflows	.github/workflows	Bypass security audits for now.	Jun 20, 2024
ci	ci	* Fixed import of dependency in `sqlite_database` - it caused bulding…	Mar 19, 2024
event_sidecar	event_sidecar	Merge remote-tracking branch 'upstream/feat-2.0' into binary_port_fixes	Jun 12, 2024
json_rpc	json_rpc	casper-json-rpc updated highlighting and cleanup	Jun 7, 2024
listener	listener	Added documentation for legacy translation. Fixed removed tests. Adde… (	May 14, 2024
metrics	metrics	Added metrics that trace response time of RPC methods. Removed PATH_A…	Mar 28, 2024
resources	resources	Merge branch 'feat-2.0' into debian-package-updates	Jun 20, 2024
rpc_sidecar	rpc_sidecar	Handle error codes recently added to binary port	Jun 27, 2024
sidecar	sidecar	Merge branch 'feat-2.0' into debian-package-updates	Jun 20, 2024
types	types	Update test fixture to cover `transaction_category`	Jun 18, 2024
.gitignore	.gitignore	Fixes tests and removes dead code	Jun 16, 2022
Cargo.lock	Cargo.lock	Handle error codes recently added to binary port	Jun 27, 2024
Cargo.toml	Cargo.toml	Point `casper-types` and `casper-binary-port` back to `feat-2.0`	Jun 12, 2024
LEGACY_SSE_EMULATION.md	LEGACY_SSE_EMULATION.md	Review feedback - remove HTML	Jun 7, 2024
LICENSE	LICENSE	Backporting changes made for 0.1.1 to 1.0.0; minor fixes; code alignm…	Oct 3, 2023
README.md	README.md	Review feedback incl. updated highlighting	Jun 7, 2024
USAGE.md	USAGE.md	Add usage steps for replaying events	Jun 14, 2024
clippy.toml	clippy.toml	Restructuring integration tests to make them more compact and easier … (	Aug 29, 2023
rust-toolchain.toml	rust-toolchain.toml	Added 'rpc.discover' method to speculative json rpc api (#278 )	Apr 11, 2024

The Casper Sidecar

Summary of Purpose
System Components and Architecture
Configuring the Sidecar
Running and Testing the Sidecar
Swagger Documentation
OpenAPI Specification
Troubleshooting Tips

Summary of Purpose

The Casper Sidecar is an application running in tandem with the node process. It allows subscribers to monitor a node's event stream, query stored events, and query the node's JSON RPC API, thus receiving faster responses and reducing the load placed on the node. Its primary purpose is to:

Offload the node from broadcasting SSE events to multiple clients.
Provide client features that aren't part of the nodes' functionality, nor should they be.

While the primary use case for the Sidecar application is running alongside the node on the same machine, it can be run remotely if necessary.

System Components and Architecture

The Casper Sidecar provides the following functionalities:

A server-sent events (SSE) server with an /events endpoint that streams all the events received from all connected nodes. The Sidecar also stores these events.
A REST API server that allows clients to query stored events.
A JSON RPC bridge between end users and a Casper node's binary port.
Legacy emulation for clients using older versions of the SSE API.

The Sidecar has the following components and external dependencies:

---
title: The Casper Sidecar Components
---
   graph LR;
   subgraph CASPER_SIDECAR
      SSE_SERVER["SSE server"]
      RPC_API_SERVER["RPC API server (JSON)"]
      REST_API["Rest API server"]
      ADMIN_API["Admin API server"]
   end
   CONFIG{{"Config file (TOML)"}}
   CONFIG --> CASPER_SIDECAR
   STORAGE[(Storage)]
   NODE_SSE(("Casper node SSE port"))
   NODE_BINARY(("Casper node binary port"))
   RPC_API_SERVER --> NODE_BINARY
   SSE_SERVER --> NODE_SSE
   SSE_SERVER --> STORAGE
   STORAGE --> REST_API

The SSE server

The SSE Server has these components:

   graph TD;
   CLIENT{Client}
   CLIENT --> SSE_SERVER_API
   STORAGE[("Storage")]
   CONFIG{{"Config file (TOML)"}}
   MAIN --1.reads--> CONFIG
   NODE_SSE{Node SSE port}
   SSE_LISTENER --2--> STORAGE
   NODE_SSE --1--> SSE_LISTENER
   subgraph CASPER_SIDECAR
     MAIN[main.rs]
     MAIN --2.spawns---> SSE_SERVER
     subgraph SSE_SERVER
        SSE_SERVER_API["SSE API"]
        RING_BUFFER["Events buffer"]
        SSE_SERVER_API --> RING_BUFFER
        SSE_LISTENER --3--> RING_BUFFER
        subgraph "connection"
          SSE_LISTENER["SSE listener"]   
        end
     end
   end

The SSE Listener processes events in this order:

Fetch an event from the node's SSE port.
Store the event.
Publish the event to the SSE API.

Casper nodes offer an event stream API that returns server-sent events (SSEs) with JSON-encoded data. The Sidecar reads the event stream of all connected nodes, acting as a passthrough and replicating the SSE interface of the connected nodes.

The Sidecar can:

Republish the current events from the node to clients listening to Sidecar's SSE API.
Publish a configurable number of previous events to clients connecting to the Sidecar's SSE API with ?start_from= query (similar to the node's SSE API).
Store the events in external storage for clients to query them via the Sidecar's REST API.

Enabling and configuring the SSE Server of the Sidecar is optional.

The REST API server

The Sidecar offers an optional REST API that allows clients to query the events stored in external storage. You can discover the specific endpoints of the REST API using OpenAPI and Swagger. The usage instructions provide more details.

   graph LR;
   CLIENT{Client}
   CLIENT --> REST_API
   STORAGE[("Storage")]
   REST_API --> STORAGE
   CONFIG{{"Config file (TOML)"}}
   MAIN --1.reads--> CONFIG
   subgraph CASPER_SIDECAR
      MAIN[main.rs]
      MAIN --2.spawns--> REST_API
      REST_API["REST API"]
   end

The Admin API server

The Sidecar offers an administrative API to allow an operator to check its current status. The Sidecar operator has the option to enable and configure this API. Please see the admin server configuration for details.

   graph LR;
   CLIENT{Client}
   CLIENT --> ADMIN_API
   CONFIG{{Config file}}
   MAIN --1.reads--> CONFIG
   subgraph CASPER_SIDECAR
      MAIN[main.rs]
      MAIN --2.spawns--> ADMIN_API
      ADMIN_API["ADMIN API"]
   end

The RPC API server

The Sidecar also offers an RPC JSON API server that can be enabled and configured so that clients can interact with a Casper network. It is a JSON bridge between end users and a Casper node's binary port. The RPC API server forwards requests to the Casper node's binary port. For more details on how the RPC JSON API works, see the RPC Sidecar README.

   graph LR;
   CLIENT{Client}
   CLIENT --> RPC_API
   CONFIG{{Config file}}
   MAIN --1.reads--> CONFIG
   CASPER_NODE(("Casper Node binary port"))
   RPC_API --forwards request--> CASPER_NODE
   subgraph "Casper Sidecar"
      MAIN[main.rs]
      MAIN --2.spawns--> RPC_API
      RPC_API["RPC JSON API"]
   end

Configuring the Sidecar

The Sidecar service must be configured using a .toml file specified at runtime.

This repository contains several sample configuration files that can be used as examples and adjusted according to your scenario:

EXAMPLE_NCTL_CONFIG.toml - Configuration for connecting to nodes on a local NCTL network. This configuration is used in the unit and integration tests found in this repository.
EXAMPLE_NCTL_POSTGRES_CONFIG.toml - Configuration for using the PostgreSQL database and nodes on a local NCTL network.
EXAMPLE_NODE_CONFIG.toml - Configuration for connecting to live nodes on a Casper network.

Once you create the configuration file and are ready to run the Sidecar service, you must provide the configuration as an argument using the -- --path-to-config option as described here.

RPC server setup

Here is an example configuration for the RPC API server:

[rpc_server.main_server]
enable_server = true
address = '0.0.0.0:7777'
qps_limit = 100
max_body_bytes = 2_621_440
cors_origin = ''

[rpc_server.node_client]
address = '0.0.0.0:28101'
max_message_size_bytes = 4_194_304
request_limit = 3
request_buffer_size = 16
message_timeout_secs = 30
client_access_timeout_secs = 2

[rpc_server.speculative_exec_server]
enable_server = true
address = '0.0.0.0:7778'
qps_limit = 1
max_body_bytes = 2_621_440
cors_origin = ''


[rpc_server.node_client.exponential_backoff]
initial_delay_ms = 1000
max_delay_ms = 32_000
coefficient = 2
max_attempts = 30

main_server.enable_server - The RPC API server will be enabled if set to true.
main_server.address - Address under which the main RPC API server will be available.
main_server.qps_limit - The maximum number of requests per second.
main_server.max_body_bytes - Maximum body size of request to API in bytes.
main_server.cors_origin - Configures the CORS origin.
speculative_exec_server.enable_server - If set to true, the speculative RPC API server will be enabled.
speculative_exec_server.address - Address under which the speculative RPC API server will be available.
speculative_exec_server.qps_limit - The maximum number of requests per second.
speculative_exec_server.max_body_bytes - Maximum body size of request to API in bytes.
speculative_exec_server.cors_origin - Configures the CORS origin.
node_client.address - Address of the Casper Node binary port.
node_client.max_message_size_bytes - Maximum binary port message size in bytes.
node_client.request_limit - Maximum number of in-flight requests.
node_client.request_buffer_size - Number of node requests that can be buffered.
node_client.message_timeout_secs - Timeout for the message.
node_client.client_access_timeout_secs - Timeout for the client connection.
node_client.exponential_backoff.initial_delay_ms - Timeout after the first broken connection (backoff) in milliseconds.
node_client.exponential_backoff.max_delay_ms - Maximum timeout after a broken connection in milliseconds.
node_client.exponential_backoff.coefficient - Coefficient for the exponential backoff. The next timeout is calculated as min(current_timeout * coefficient, max_delay_ms).
node_client.exponential_backoff.max_attempts - Maximum number of times to try to reconnect to the binary port of the node.

SSE server setup

The Sidecar SSE server is used to connect to Casper nodes, listen to events from them, store them locally and re-broadcast them to clients. Here is a sample configuration for the SSE server:

[sse_server]
enable_server = true
emulate_legacy_sse_apis = ["V1"]

[[sse_server.connections]]
 <Described later in the document>

[sse_server.event_stream_server]
 <Described later in the document>

sse_server.enable_server - If set to true, the SSE server will be enabled.
sse_server.emulate_legacy_sse_apis - A list of legacy Casper node SSE APIs to emulate. The Sidecar will expose SSE endpoints that are compatible with specified versions. Please bear in mind that this feature is an emulation and should be used only for transition periods. In most scenarios, having a 1-to-1 mapping of new messages into old formats is impossible, so this can be a process that loses some data and/or doesn't emit all messages that come from the Casper node. See the Legacy SSE Emulation page for more details.

Configuring SSE node connections

The Sidecar's SSE component can connect to Casper nodes' SSE endpoints with versions greater or equal to 2.0.0.

The node_connections option configures the node (or multiple nodes) to which the Sidecar will connect and the parameters under which it will operate with that node. Connecting to multiple nodes requires multiple [[sse_server.connections]] sections.

[sse_server]
enable_server = true

[[sse_server.connections]]
ip_address = "127.0.0.1"
sse_port = 18101
rest_port = 14101
max_attempts = 10
delay_between_retries_in_seconds = 5
allow_partial_connection = false
enable_logging = true
connection_timeout_in_seconds = 3
no_message_timeout_in_seconds = 60
sleep_between_keep_alive_checks_in_seconds = 30

[[sse_server.connections]]
ip_address = "127.0.0.1"
sse_port = 18102
rest_port = 14102
max_attempts = 10
delay_between_retries_in_seconds = 5
allow_partial_connection = false
enable_logging = false
connection_timeout_in_seconds = 3
no_message_timeout_in_seconds = 60
sleep_between_keep_alive_checks_in_seconds = 30

[[sse_server.connections]]
ip_address = "127.0.0.1"
sse_port = 18103
rest_port = 14103
max_attempts = 10
delay_between_retries_in_seconds = 5
allow_partial_connection = false
enable_logging = false
connection_timeout_in_seconds = 3
no_message_timeout_in_seconds = 60
sleep_between_keep_alive_checks_in_seconds = 30

ip_address - The IP address of the node to monitor.
sse_port - The node's event stream (SSE) port. This example configuration uses port 9999.
rest_port - The node's REST endpoint for status and metrics. This example configuration uses port 8888.
max_attempts - The maximum number of attempts the Sidecar will make to connect to the node. If set to 0, the Sidecar will not attempt to connect.
delay_between_retries_in_seconds - The delay between attempts to connect to the node.
allow_partial_connection - Determining whether the Sidecar will allow a partial connection to this node.
enable_logging - This enables the logging of events from the node in question.
connection_timeout_in_seconds - Number of seconds before the connection request times out. This parameter is optional, and defaults to 5.
no_message_timeout_in_seconds - Number of seconds after which the connection will be restarted if no bytes were received. This parameter is optional, and defaults to 120.
sleep_between_keep_alive_checks_in_seconds - Optional parameter specifying the time intervals (in seconds) for checking if the connection is still alive. Defaults to 60.

Configuring SSE legacy emulations

Applications using version 1 of a Casper node's event stream server can still function using an emulated V1 SSE API for a limited time. Enabling the V1 SSE API emulation requires the emulate_legacy_sse_apis setting to be ["V1"]:

[sse_server]
enable_server = true
emulate_legacy_sse_apis = ["V1"]

This setting will expose three legacy SSE endpoints with the following events streamed on each endpoint:

/events/sigs - Finality Signature events
/events/deploys - DeployAccepted events
/events/main - All other legacy events, including BlockAdded, DeployProcessed, DeployExpired, Fault, Step, and Shutdown events

See the Legacy SSE Emulation page for more details.

Configuring the event stream

To configure the Sidecar's event stream server, specify the following settings:

[sse_server.event_stream_server]
port = 19999
max_concurrent_subscribers = 100
event_stream_buffer_length = 5000

event_stream_server.port - The port under which the Sidecar's SSE server publishes events.
event_stream_server.max_concurrent_subscribers - The maximum number of subscribers that can monitor the Sidecar's event stream.
event_stream_server.event_stream_buffer_length - The number of events that the stream will hold in its buffer for reference when a subscriber reconnects.

REST server setup

The following section determines outbound connection criteria for the Sidecar's REST server.

[rest_api_server]
enable_server = true
port = 18888
max_concurrent_requests = 50
max_requests_per_second = 50
request_timeout_in_seconds = 10

enable_server - If set to true, the RPC API server will be enabled.
port - The port for accessing the Sidecar's REST server. 18888 is the default, but operators are free to choose their own port as needed.
max_concurrent_requests - The maximum total number of simultaneous requests that can be made to the REST server.
max_requests_per_second - The maximum total number of requests that can be made per second.
request_timeout_in_seconds - The total time before a request times out.

Storage setup

This directory stores the SSE cache and an SQLite database if the Sidecar was configured to use SQLite.

[storage]
storage_path = "./target/storage"

Database connectivity setup

The Sidecar can connect to different types of databases. The current options are SQLite or PostgreSQL. The following sections show how to configure the database connection. Note that the Sidecar can only connect to one database at a time.

SQLite database

This section includes configurations for the SQLite database.

[storage.sqlite_config]
file_name = "sqlite_database.db3"
max_connections_in_pool = 100
wal_autocheckpointing_interval = 1000

storage.sqlite_config.file_name - The database file path.
storage.sqlite_config.max_connections_in_pool - The maximum number of connections to the database (should generally be left as is).
storage.sqlite_config.wal_autocheckpointing_interval - This controls how often the system commits pages to the database. The value determines the maximum number of pages before forcing a commit. More information can be found here.

PostgreSQL database

The properties listed below are elements of the PostgreSQL database connection that can be configured for the Sidecar.

storage.postgresql_config.database_name - Name of the database.
storage.postgresql_config.host - URL to PostgreSQL instance.
storage.postgresql_config.database_username - Username.
storage.postgresql_config.database_password - Database password.
storage.postgresql_config.max_connections_in_pool - The maximum number of connections to the database.
storage.postgresql_config.port - The port for the database connection.

To run the Sidecar with PostgreSQL, you can set the following database environment variables to control how the Sidecar connects to the database. This is the suggested method to set the connection information for the PostgreSQL database.

SIDECAR_POSTGRES_USERNAME="your username"

SIDECAR_POSTGRES_PASSWORD="your password"

SIDECAR_POSTGRES_DATABASE_NAME="your database name"

SIDECAR_POSTGRES_HOST="your host"

SIDECAR_POSTGRES_MAX_CONNECTIONS="max connections"

SIDECAR_POSTGRES_PORT="port"

However, DB connectivity can also be configured using the Sidecar configuration file. If the DB environment variables and the Sidecar's configuration file have the same variable set, the DB environment variables will take precedence.

It is possible to completely omit the PostgreSQL configuration from the Sidecar's configuration file. In this case, the Sidecar will attempt to connect to the PostgreSQL using the database environment variables or use some default values for non-critical variables.

[storage.postgresql_config]
database_name = "event_sidecar"
host = "localhost"
database_password = "p@$$w0rd"
database_username = "postgres"
max_connections_in_pool = 30

Admin server setup

This optional section configures the Sidecar's administrative server. If this section is not specified, the Sidecar will not start an admin server.

[admin_api_server]
enable_server = true
port = 18887
max_concurrent_requests = 1
max_requests_per_second = 1

enable_server - If set to true, the RPC API server will be enabled.
port - The port for accessing the Sidecar's admin server.
max_concurrent_requests - The maximum total number of simultaneous requests that can be sent to the admin server.
max_requests_per_second - The maximum total number of requests that can be sent per second to the admin server.

Access the admin server at http://localhost:18887/metrics/.

Running and Testing the Sidecar

Prerequisites

To compile, test, and run the Sidecar, install the following software first:

CMake 3.1.4 or greater
Rust
pkg-config
gcc
g++

Running the Sidecar

After creating the configuration file, run the Sidecar using cargo and point to the configuration file using the --path-to-config option, as shown below. The command needs to run with root privileges.

sudo cargo run -- --path-to-config ./resources/example_configs/EXAMPLE_NODE_CONFIG.toml

The Sidecar application leverages tracing, which can be controlled by setting the RUST_LOG environment variable.

The following command will run the Sidecar application with the INFO log level.

RUST_LOG=info cargo run -p casper-sidecar -- --path-to-config ./resources/example_configs/EXAMPLE_NCTL_CONFIG.toml

The log levels, listed in order of increasing verbosity, are:

ERROR
WARN
INFO
DEBUG
TRACE

Further details about log levels can be found here.

Testing the Sidecar

You can run the unit and integration tests included in this repository with the following command:

cargo test

You can also run the performance tests using this command:

cargo test -- --include-ignored

The EXAMPLE_NCTL_CONFIG.toml file contains the configurations used for these tests.

Testing the Sidecar using NCTL

The Sidecar application can be tested against live Casper nodes or a local NCTL network.

The configuration shown here will direct the Sidecar application to a locally hosted NCTL network if one is running. The Sidecar should function the same way it would while connected to a live node, displaying events as they occur in the local NCTL network.

Swagger Documentation

Once the Sidecar is running, access the Swagger documentation at http://localhost:18888/swagger-ui/. You need to replace localhost with the IP address of the machine running the Sidecar application if you are running the Sidecar remotely. The Swagger documentation will allow you to test the REST API.

OpenAPI Specification

An OpenAPI schema is available at http://localhost:18888/api-doc.json/. You need to replace localhost with the IP address of the machine running the Sidecar application if you are running the Sidecar remotely.

Troubleshooting Tips

This section covers helpful tips when troubleshooting the Sidecar service. Replace the URL and ports provided in the examples as appropriate.

Checking liveness

To check whether the Sidecar is running, run the following curl command, which returns the newest stored block.

curl http://SIDECAR_URL:SIDECAR_REST_PORT/block

Each block should have a .block.header.timestamp field. Even if there were no deploys, a block should be produced every 30-60 seconds. If the latest block falls behind, it means there is an issue with the Sidecar reading events from the node. Here is a helpful script provided jq is installed:

curl http://SIDECAR_URL:SIDECAR_REST_PORT/block | jq '.block.header.timestamp'

Checking the node connection

Checking the node connection status requires the admin server to be enabled, as shown here. Use this curl command and observe the output:

curl http://SIDECAR_URL:SIDECAR_ADMIN_PORT/metrics

Sample output:

# HELP node_statuses Current status of node to which the Sidecar is connected. Numbers mean: 0 - preparing; 1 - connecting; 2 - connected; 3 - reconnecting; -1 - connections_exhausted -> used up all connection attempts ; -2 - incompatible -> node is in an incompatible version
# TYPE node_statuses gauge
node_statuses{node="35.180.42.211:9999"} 2
node_statuses{node="69.197.42.27:9999"} 2

In the above node_statuses, you can see which nodes are connecting, which are already connected, which are disconnected due to no more retries, etc. The number next to each node represents the connection status:

0 - The Sidecar is preparing to connect
1 - The Sidecar is connecting to the node
2 - The Sidecar is connected to this node
3 - The Sidecar is reconnecting
-1 - The Sidecar is not connected and has reached the maximum connection attempts
-2 - The Sidecar is not connected due to an incompatible node version

Diagnosing errors

To diagnose errors, look for error logs and check the error_counts on the metrics page, http://SIDECAR_URL:SIDECAR_ADMIN_PORT/metrics, where most of the errors related to data flow will be stored:

# HELP error_counts Error counts
# TYPE error_counts counter
error_counts{category="connection_manager",description="fetching_from_stream_failed"} 6

Monitoring memory consumption

To monitor the Sidecar's memory consumption, observe the metrics page, http://SIDECAR_URL:SIDECAR_ADMIN_PORT/metrics. Search for process_resident_memory_bytes:

# HELP process_resident_memory_bytes Resident memory size in bytes.
# TYPE process_resident_memory_bytes gauge
process_resident_memory_bytes 292110336

If memory consumption is high without an apparent reason, please inform the Sidecar team by creating an issue in GitHub.

Remember to check the event_stream_buffer_length setting in the configuration because it dramatically impacts how much memory the Sidecar consumes. Also, some events, like step events, consume more memory.

Ensuring sufficient storage

Ensuring enough space in the database is essential for the Sidecar to consume events produced from the nodes' SSE streams over a more extended period. Each event is written to the database in a raw format for future processing. Running the Sidecar for an extended period (weeks or months) can result in storing multiple Gigabytes of data. If the database runs out of space, the Sidecar will lose events, as it cannot record them.

Inspecting the REST API

The easiest way to inspect the Sidecar’s REST API is with Swagger.

Limiting concurrent requests

The Sidecar can be configured to limit concurrent requests (max_concurrent_requests) and requests per second (max_requests_per_second) for the REST and admin servers.

However, remember that those are application-level guards, meaning that the operating system already accepted the connection, which used up the operating system's resources. Limiting potential DDoS attacks requires consideration before the requests are directed to the Sidecar application.

License

Licensed under the Apache License Version 2.0.

License

casper-network/casper-sidecar

Folders and files

Latest commit

History

Repository files navigation

The Casper Sidecar

Summary of Purpose

System Components and Architecture

The SSE server

The REST API server

The Admin API server

The RPC API server

Configuring the Sidecar

RPC server setup

SSE server setup

Configuring SSE node connections

Configuring SSE legacy emulations

Configuring the event stream

REST server setup

Storage setup

Database connectivity setup

SQLite database

PostgreSQL database

Admin server setup

Running and Testing the Sidecar

Prerequisites

Running the Sidecar

Testing the Sidecar

Testing the Sidecar using NCTL

Swagger Documentation

OpenAPI Specification

Troubleshooting Tips

Checking liveness

Checking the node connection

Diagnosing errors

Monitoring memory consumption

Ensuring sufficient storage

Inspecting the REST API

Limiting concurrent requests

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 8

Languages

Packages