Fast Global File Status (FGFS) is an open-source package that provides scalable mechanisms and programming interfaces to retrieve global information of a file.
Topic: HPC Systems and Software
MPI_T is an interface for tools introduced in the 3.0 version of MPI. The interface provides mechanisms for tools to access and set performance and control variables that are exposed by an MPI implementation.
Cram lets you easily run many small MPI jobs within a single, large MPI job by splitting MPI_COMM_WORLD up into many small communicators to run each job in the cram file independently.
libMSR provides a convenient interface to access Model Specific Registers and to allow tools to utilize their full functionality.
Veritas provides a method for validating proxy applications to ensure that they capture the intended characteristics of their parents.
Application-level resilience is emerging as an alternative to traditional fault tolerance approaches because it provides fault tolerance at a lower cost than traditional approaches.
This tool that automatically diagnoses performance and correctness faults in MPI applications. It identifies abnormal MPI tasks and code regions and finds the least-progressed task.
Working on world-class supercomputers at a U.S. national laboratory was not what Edgar Leon, a native of Mexico, envisioned when he began preparing for university.
These techniques emulate the behavior of anticipated future architectures on current machines to improve performance modeling and evaluation.
With SCR, jobs run more efficiently, recover more work upon failure, and reduce load on critical shared resources.
Greg Lee helps develop tools designed to boost performance and productivity of Livermore scientists.
Olga Pearce studies how to detect and correct load imbalance in high performance computing applications.
Working extensively with open-source software such as Lustre and engaging with the broader open-source community is what computer scientist Chris Morrone enjoys most about his job, and it is one of the features that attracted him to LLNL in the first place.
Kathryn Mohror develops tools that give researchers the information they need to tune their programs and maximize results. After all, she says, “It’s all about getting the answers more quickly.”