Skip to content

Latest commit

 

History

History
68 lines (36 loc) · 3.5 KB

README.md

File metadata and controls

68 lines (36 loc) · 3.5 KB

EDA - Data Consistency and Replication

This repository includes a set of documents for best practices around data replication between two Kafka clusters and data consistency practices.

Better to read in BOOK format.

Requirements

Mirror Maker 2.0

Local cluster to Event Streams on Cloud

The goal is to demonstrate the replicate data from local cluster to Event Streams on IBM Cloud running as managed service. The two scenarios and the step by step approach are presented in this note.

We have also documented the replication approaches from Event Streams as a Service to local cluster in this note.

Provisioning Connectors (Mirror Maker 2)

This main epic is related to provisioning operation.

  1. As a SRE I want to provision and deploy Mirror Maker 2 connector to existing Openshift cluster without exposing password and keys so replication can start working.

    • This will use Kubernetes secrets for configuration parameters to avoid exposing sensitive data. We describe the approach in this section
  2. As a SRE I want to understand the CLI commands used to assess how the provisioning process can be automated. See the note here

  3. As a SRE I want to understand the server sizing for the Mirror Maker environment.

Note that, there is no specific user interface for mirror maker connector.

Version to version migration

  1. As a SRE, I want to understand how to perform a version to version migration for the Mirror Maker 2 product so that existing streaming replication is not impacted by the upgrade.

  2. As a developer I want to deploy configuration updates to modify the topic white or black lists so that newly added topics are replicated.

Security

  1. As a SRE, I want to understand how the security support to connect client applications to cluster and to replicated topic.

  2. As a developer I want to design Mirror Maker 2 based replication solution to support different line of businesses who should not connect to topics and data not related to their business and security scope.

Monitoring

  1. As a SRE, I want to get Mirror Maker 2 metrics for Prometheus so that it fits in my current metrics processing practices. The explanation to setup Prometheus metris for mirror maker 2.0 is documented here

  2. As a SRE, I want to be able to add new dashboard into Grafana to visualize the Mirror Maker 2 metrics.

  3. As a SRE, I want to define rules for alert reporting and configure a Slack channel for alerting.

  4. As a SRE, I want to get the Mirror Maker 2 logs into our Splunk logging platform.

Best Practices

  1. As a developer I want to understand how Mirror Maker 2 based replication address the record duplication.

  2. As a developer I want to design replication solution to minimize the instance of Mirror Maker or being able to scale them if I observe lag into data replication processing.

  3. As a developer I want to understand what are the condition for message loss.

Performance tests

  1. As a developer I want to understand how to measure data latency and lag in data replication.

  2. As a SRE I want to understand current thoughput for the replication solution.

  • Ensure message affinity