to_do_list

A distributed to do list which can be used to keep a replicated to do list on a Fault tolerant distributed system. This System stores and keeps the To-Do-List on upto 3 replicated servers. This project was made for course 18-749 Building Reliable Distributed Systems in Summer 2020.

This involved the design and implementation of an industry-grade fault-tolerant distributed asynchronous system, with heartbeats, distributed consensus, total ordering, checkpointing, and logging to provide strong consistency for a distributed replicated application. This project involved supporting different replication styles (active, or hot-swap replication, as well as passive, or primary-backup replication), along with mechanisms to ensure no downtime even as crash faults are injected.

This System has a total of 4 applications: -Server -Client -Replication Manager -Local Fault detector (LFD) -factory

All these application wok together to provide all of he consistency promises listed above in the face falts.

At max the System can support 3 Active/Passive replicas which involves a total of 13 asynchronous processes running across 4 different machine. 3 Clients, 3 Servers, 3 LFDs, 3 Factories and a global replication manager.

All these software components communicate with messages over TCP/IP protocol.

To compile:

make format; make clean; make;

To start the server:

./server {IP} {Port}

To start the client:

./client -C {Client ID} -I {Heartbeat interval} -H {Server IP} -P {Port}

The interval is optional. If no value is given as input then default 10 sec value is set.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

to_do_list

Files

README.md

Latest commit

History

README.md

File metadata and controls

to_do_list