Veritas
Veritas provides a method for validating proxy applications to ensure that they capture the intended characteristics of their parents.
Application-Level Resilience
Application-level resilience is emerging as an alternative to traditional fault tolerance approaches because it provides fault tolerance at a lower cost than traditional approaches.
AutomaDeD
This tool that automatically diagnoses performance and correctness faults in MPI applications. It identifies abnormal MPI tasks and code regions and finds the least-progressed task.
Investigation of disaggregated memory systems wins poster award
Splitting memory resources in high performance computing between local nodes and a larger shared remote pool can help better support diverse applications.
LLNL’s Diachin takes helm of DOE’s Exascale Computing Project
Lori Diachin will take over as director of the DOE’s Exascale Computing Project on June 1, guiding the successful, multi-institutional high performance computing effort through its final stages.
Podcast: Siting the El Capitan exascale supercomputer
Livermore CTO Bronis de Supinski joins the Let's Talk Exascale podcast to discuss the details of LLNL's upcoming exascale supercomputer.