Continuous Assurance - Dashboard

The Continuous Assurance Dashboard is a stand-alone, Python Flask-based application that is capable of running pre-defined security reporting. What's missing though, is your data, and your metrics.

Once you have your data, a simple API call to the app will result in the data getting indexed, trended over time, and served straight away.

Getting started

The app is built on Python 3.13. Make sure you are running at least 3.x. Some older versions may also work.

Clone the repo

git clone https://github.com/massyn/cyber-dashboard-flask
cd cyber-dashboard-flask

Install Python requirements

pip install -r requirements.txt

Run the app

python app.py

Open the app

http://localhost:8080

Logon with the default credentials (user / password123)

At this point, the dashboard would appear, but it will be completely empty. Let's add a metric to it.

Add a metric to the dashboard

We will add some data. We use a simple vulnerability management metric that indicates if a server is compliant (1) or not (0) against some arbitrary criteria that was set. A server also belongs to a particular business unit, so we prepare the following data.

metric_id	resource	compliance	business_unit
vm-99	ServerA	1	Sales
vm-99	ServerB	0	Sales
vm-99	ServerC	1	Sales
vm-99	ServerD	1	Sales
vm-99	ServerE	0	Marketing
vm-99	ServerF	0	Marketing
vm-99	ServerG	1	Marketing
vm-99	ServerH	0	Marketing
vm-99	ServerI	1	Marketing
vm-99	ServerJ	1	Marketing

Prepare a csv data blob of the data you'd like to ingest. The following curl command can be used to simulate the loading of this data via API.

    curl -X POST http://localhost:8080/api \
    -H "Authorization: Bearer 6002920168C3253430A653E16AD36EE88F6E3C7D917A5F245F735D96ABDA67FE" \
    -H "Content-Type: text/csv" \
    --data-binary $'metric_id,resource,compliance,business_unit\nvm-99,ServerA,1,Sales\nvm-99,ServerB,0,Sales\nvm-99,ServerC,1,Sales\nvm-99,ServerD,1,Sales\nvm-99,ServerE,0,Marketing\nvm-99,ServerF,0,Marketing\nvm-99,ServerG,1,Marketing\nvm-99,ServerH,0,Marketing\nvm-99,ServerI,1,Marketing\nvm-99,ServerJ,1,Marketing\n'

On success, you should see a message like this:

{"message":"Uploaded 10 records","result":[{"metric_id":"vm-99","score":0.6,"total":10,"totalok":6}],"success":true}

What we're seeing here, is that 10 records got uploaded, it was successful, and the system calculated the percentage compliance for us, and it came in at 60%.

Go back and open the dashboard again. What just happened? The first data you just loaded got rendered on the dashboard. The filters got added, you have the ability to switch from one business unit to another, and see their respective scores.

Deeper dive - what happened in the background?

The /api call serves one purpose only - to load data into the dashboard. It only accepts csv data. The csv format has a number of fields.

csv format

Field	Type	Mandatory	Description	Default
`datestamp`	string	No	Date of the record using format `YYYY-MM-DD`	Today
`metric_id`	string	Yes	Unique identifier for the metric
`title`	string	No	Title of the metric	metric_id
`category`	string	No	Category (or security domain) of the metric	`undefined`
`slo`	float	No	Service level objective percentage between 0 and 1	0.95
`slo_min`	float	No	Minimum service level objective percentage between 0 and 1	0.90
`weight`	float	No	Weight of the metric	0.90
`resource`	string	Yes	Unique identifier for the resource being
`compliance`	float	No	Compliance value between 0 and 1	0
`detail`	string	No	Additional information to help with remediation	blank
*	string	No	Additional dimensions to be used if required	`undefined`

Sanisation

The sanisation function will read the provided csv data, fill in the gaps with default values, and fail the upload where mandatory data is either not provided, or not meeting the required formats.

Store the detail

Data is stored in the parquet format. When saving the detail, only the latest load per metric is retained. The store function will delete the metric id being provided by the api call, and will replace it with the sanitised data. That way the detail will always contain a copy of the latest detail.

Summarise the data

Summarising the data is performed by counting the number of resources in the metric, as well as summing the values of compliance. Like a pivot table, the dimensions are populated with these values.

In addition to summarising the values, the data is also stored as a time series. The table will be used to calculate trends over time.

Field	Type	Description
`datestamp`	string	Date of the record using format `YYYY-MM-DD`
`metric_id`	string	Unique identifier for the metric
`title`	string	Title of the metric
`category`	string	Category (or security domain) of the metric
`slo`	float	Service level objective percentage between 0 and 1
`slo_min`	float	Minimum service level objective percentage between 0 and 1
`weight`	float	Weight of the metric
`totalok`	float	Sum of compliance
`total`	float	Count of compliance
*	string	Additional dimensions to be used if required

About the dashboard

Aggregation

Aggregation is the process of merging metrics together, to allow for a consolidated view. Think of it as a way to average the metrics together, as to see what the overall impact is. Aggregation is useful for scenarios where the actual metric is not of importance, but rather the net result of some of the them grouped together.

The dashboard does two types of aggregation. The first is on the first dimension (as specified in the config.yml file), showing a breakdown of the latest metric data load aggregated by that dimension. The second is an aggregation by the category column (as specified during the data load process).

Aggregation is based off a weighted-average approach. Rather than just simply doing an average of all metrics, you have the ability to adjust the weight of each metric during data load. If one metric is more important than another, you have the ability to adjust the weight for that metric which will result in that metric's score having a bigger influence over the resulting score.

Attribution

Attribution is the process of assigning resources to a dimension, like a business unit, team or location. It allows the dashboard to filter the metrics by that particular dimension to provide better insights in where a particular issue may exist.

Dimensions can be customised. Update the config.yml file with any additional dimensions you'd like to introduce in the schema. Be sure to update your API calls to the dashboard to include the dimensions in your data load.

config.yml

The config.yaml file describes the basic behaviour used for configuring the cyber dashboard.

`dimensions`

Defines the dimension fields used to categorize data on the dashboard. These dimensions are tied to specific column names in the CSV or data files. These fields can be customised if you want to load additional data. The following fields are defined.

business_unit: The label for the "Business Unit" dimension. This field maps to the business_unit column in the dataset and represents different business units within the organization.
team: The label for the "Team" dimension. This field corresponds to the team column and represents various teams within the business.
location: The label for the "Location" dimension. This maps to the location column in the data and identifies where the team or business unit is physically located.

`tokens`

An array of bearer tokens that are used for authentication purposes. These tokens should be updated before going live to ensure security.

Example:
```
tokens:
  - 6002920168C3253430A653E16AD36EE88F6E3C7D917A5F245F735D96ABDA67FE
```
- Note: Make sure to replace these tokens before the application goes live.

`secret_key`

A secret key used for session token encryption. This key should be kept secure and changed before going live to prevent unauthorized access to session data.

Example:
```
secret_key: FB7E8E6841E29B45B1158029C9606D43FA8083704EBA83EC620FAD2373BDAEBE
```
- Note: This key is critical for the security of your application. Ensure it is changed before deployment.

`data`

Defines the paths to the data files used in the dashboard.

detail: Specifies the location of the detailed data file (detail.parquet). This file is used for providing detailed insights into the operational data.
summary: Specifies the location of the summary data file (summary.parquet). This file aggregates the data for a high-level overview on the dashboard.

Example:

detail: ../data/detail.parquet
summary: ../data/summary.parquet

`RAG` (Red-Amber-Green Color Scheme)

This section defines the color scheme for the dashboard's Risk, Alert, and Guard (RAG) indicators. You can tweak these colors to match your visual preference.

red: The colors for the "red" state, typically indicating critical or high-risk conditions. The first color represents the background, and the second represents the text color.
amber: The colors for the "amber" state, representing moderate or cautionary conditions.
green: The colors for the "green" state, representing safe or normal conditions.

Example:

RAG:
  red:   ['#C00000', '#000000']
  amber: ['#FFC000', '#FFFFFF']
  green: ['#00B050', '#000000']

Security Considerations:

Tokens and secret keys must be kept confidential and secure. Do not hard-code them in public repositories.
Review the RAG color scheme and adjust the colors for visual clarity, especially if accessibility is a concern.
This configuration file is crucial for setting up and securing the Flask-based dashboard. Ensure that all fields are properly configured and that sensitive data is handled securely before going live.

Production deployment

While the app.py script can be executed locally, you should not run this on a production system, or have this exposed to the internet. Instead, being a Flask application, you should deploy the dashboard either with Gunicorn or Waitress

You should also deploy something like an Nginx Reverse Proxy to provide an additional layer of isolation between your app and the internet.

With Nginx, you have the option of restricting access using basic authentication or hook it up to an IDP like Okta

Overwriting the config.yml attributes

Some attributes in the config.yml file can be overwritten with environment variables. This is handy in cases where you want to test some code, without the need to recompile the entire system.

The following variables are supported.

Config	Environment variable
`title`	`DASHBOARD_TITLE`
`summary`	`DASHBOARD_SUMMARY`
`detail`	`DASHBOARD_DETAIL`

Additional tops for success

The compliance field is a float, a percentage of compliance if you will. That means that when you decide compliance for a particular resource, you could infact define partial compliance.
To backup and restore, the data folder, and config.yml needs to be backed up. To restore, simply install a fresh instance, and replace the data folder and configy.yml files.
You can start the dashboard with a custom config.yml file by starting the app.py with the -config parameter to the new config, for example:

python app.py -config my_other_config.yml

Loading a data file locally

One particular use case I had was to be able to load data into the parquet file without the need to spin up the entire dashboard. The api.py script allows you to feed a csv file, without the need to spin up the instance.

Prepare a csv file using the same data format as described above.
run python api.py -load csv_data.csv

The load process will support the loading of .csv, .json or .parquet files depending on their file extension, provided they follow the same data load schema.

The parquet files will be updated as per normal, and any subsequent start of the dashboard will read the data in the same way.

Docker

See the cyber-metrics-docker project on how to run this dashboard as a Docker container.

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
data		data
server		server
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Continuous Assurance - Dashboard

Getting started

Deeper dive - what happened in the background?

csv format

Sanisation

Store the detail

Summarise the data

About the dashboard

Aggregation

Attribution

config.yml

`dimensions`

`tokens`

`secret_key`

`data`

`RAG` (Red-Amber-Green Color Scheme)

Security Considerations:

Production deployment

Overwriting the config.yml attributes

Additional tops for success

Loading a data file locally

Docker

About

Releases

Packages

Contributors 2

Languages

License

massyn/cyber-dashboard-flask

Folders and files

Latest commit

History

Repository files navigation

Continuous Assurance - Dashboard

Getting started

Deeper dive - what happened in the background?

csv format

Sanisation

Store the detail

Summarise the data

About the dashboard

Aggregation

Attribution

config.yml

dimensions

tokens

secret_key

data

RAG (Red-Amber-Green Color Scheme)

Security Considerations:

Production deployment

Overwriting the config.yml attributes

Additional tops for success

Loading a data file locally

Docker

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

`dimensions`

`tokens`

`secret_key`

`data`

`RAG` (Red-Amber-Green Color Scheme)

Packages