Ledmi, A., Bendjenna, H., & Hemam, S. M. (2018). Fault tolerance in distributed systems: A survey [Conference presentation]. 2018 3rd International Conference on Pattern Analysis and Intelligent Systems (PAIS), Pattern Analysis and Intelligent Systems (PAIS), 1–5.
This resource demonstrates how fault tolerance is managed in distributed systems.
Emre Ozer, A., Balaji Venu, A., Xabier Iturbe, A., Shidhartha Das, A., Spyros Lyberis, A., John Biggs, A., Peter Harrod, A., & John Penton, A. (2018). Error correlation prediction in lockstep processors for safety-critical systems. Microarchitecture, 737.
This resource demonstrates how parallel computing is used in creating fault-tolerant systems.
Wachter, E. W., Kasap, S., Zhai, X., Ehsan, S., & McDonald-Maier, K. (2019). Survey of Lockstep based mitigation techniques for soft errors in embedded systems [Conference presentation]. 2019 11th Computer Science and Electronic Engineering (CEEC), Electronic Engineering (CEEC), 124–127.
This paper provides a review of the lockstep mechanism across different levels of design abstraction: processor design, architectural level, and the software level.