Reliability, Availability, and Serviceability (RAS) for Petascale High-End Computing and Beyond

Homepage | People | Publications | Software | Opportunities


Stephen L. Scott
is the overall project lead principal investigator and the institutional principal investigator at Oak Ridge National Laboratory. Stephen is a Senior Research Scientist and leader of the System Research Team of the Computer Science Research Group in the Computer Science and Mathematics Division at Oak Ridge National Laboratory.


Christian Engelmann
is responsible for the reactive fault tolerance and holistic fault tolerance effort at Oak Ridge National Laboratory. Christian is a Research and Development Staff Member in the System Research Team of the Computer Science Research Group in the Computer Science and Mathematics Division at Oak Ridge National Laboratory. Christian is also collaborating as a Research Assistant with the Centre for Advanced Computing and Emerging Technologies at the Department of Computer Science of the University of Reading, United Kingdom.


Hong H. Ong
is responsible for the reliability analysis and experiments effort at Oak Ridge National Laboratory. Hong is a Research and Development Staff Member in the System Research Team of the Computer Science Research Group in the Computer Science and Mathematics Division at Oak Ridge National Laboratory.


Geoffroy R. Vallée
is responsible for the proactive fault tolerance effort at Oak Ridge National Laboratory. Geoffroy is a Research and Development Staff Member in the System Research Team of the Computer Science Research Group in the Computer Science and Mathematics Division at Oak Ridge National Laboratory.


Chokchai (Box) Leangsuksun
is the institutional principal investigator at Louisiana Tech University. Box is an Associate Professor at the Center for Entrepreneurship and Information Technology in the Computer Science Department at Louisiana Tech University.


Mihaela Paun
is responsible for the reliability analysis effort at Louisiana Tech University. Mihaela is an Associate Professor in the Mathematics and Statistics Department at Louisiana Tech University.


Frank Mueller
is the institutional principal investigator at North Carolina State University. Frank is an Associate Professor in the Department of Computer Science at North Carolina State University.

 

This research is sponsored by the Office of Advanced Scientific Computing Research; Office of Science; U.S. Department of Energy. The work is performed jointly at Oak Ridge National Laboratory, which is managed by UT-Battelle, LLC under Contract No. De-AC05-00OR22725, Louisiana Tech University, and North Carolina State University. Please contact engelmannc@ornl.gov with questions or comments regarding this page.