Browse this site's news, projects, and people highlights via any of the topics in the dropdown list or below each content description.
Application-Level Resilience
Application-level resilience is emerging as an alternative to traditional fault tolerance approaches because it provides fault tolerance at a lower cost than traditional approaches.
AutomaDeD
This tool that automatically diagnoses performance and correctness faults in MPI applications. It identifies abnormal MPI tasks and code regions and finds the least-progressed task.
GREMLINs
These techniques emulate the behavior of anticipated future architectures on current machines to improve performance modeling and evaluation.
LLNL, DOD, NNSA dedicate Rapid Response Laboratory and supercomputing system to accelerate biodefense
The collaboration has enabled expanding systems of the same architecture as LLNL’s upcoming exascale supercomputer, El Capitan, featuring AMD’s cutting-edge MI300A processors.
ISCP projects make machine learning advantages tangible
To keep employees abreast of the latest tools, two data science–focused projects are under way as part of Lawrence Livermore’s Institutional Scientific Capability Portfolio.
CASC Newsletter | Vol 14 | June 2024
This issue highlights some of CASC’s contributions to the DOE's Exascale Computing Project.
