Fault-Tolerance in ERF
Home Feedback Contents Search
 

Home
Thesis
Motivation
Objetives
Organization
People
Infraestructure
Publications
Presentations
Downloads
Bugs
Links

Fault-Tolerance in an Event Rule Framework for Distributed Systems

by: Hillary Caituro-Monge

Abstract

In Distributed Systems (DSs) with Asynchronous, non-deterministic Reactive Components (ANDRCs), the timing assumptions are not valid and the output could be different, even if the same sequences of stimuli are input with the same initial state. In such systems, the availability and reliability are critical, so they must be fault tolerant and it can be achieved by replicating the ANDRCs. This research uses active replication with a middle-tier component that multicasts incoming-events to all replicas, and detects and suppresses duplicated outgoing-events that were posted by each replica; it also uses a semi-active replication approach where only one (“the leader”) is able to post events, while the others listen to the leader to keep consistency, having each replica detecting/suppressing duplicated outgoing-events. The Event-Rule Framework (ERF), which is a framework for developing DSs with ANDRCs, was used as a test bed for this research. The performance analysis shows linear execution-time curves in all test scenarios. Therefore, proposed solutions were proven to be feasible, and their performance results were proven to be acceptable.