Topic: Storage, File Systems, and I/O

Supported by the Advanced Simulation and Computing program, the open-source Axom project focuses on developing software infrastructure components that can be shared by HPC applications running on diverse computing platforms.

Project

CTO Bronis de Supinski discusses the integrated storage strategy of the future El Capitan exascale supercomputing system, which will have in excess of 2 exaflops of raw computing power spread across nodes.

News Item

A near node local storage innovation called Rabbit factored heavily into LLNL’s decision to select Cray’s proposal for its CORAL-2 machine, the lab’s first exascale-class supercomputer, El Capitan.

News Item

LLNL is home to advanced tech at the high end of compute, networking, and storage. The article includes a video featuring Robin Goldstone, HPC strategist in the Lab's Advanced Technologies Office.

News Item

LLNL is now home to the world’s largest Spectra TFinity system, following a complete replacement of the tape library hardware that supports Livermore’s data archives.

News Item

A software product from the ECP called UnifyFS can provide I/O performance portability for applications, enabling them to use distributed in-system storage and the parallel file system.

News Item

In the second of three news features, LLNL-developed OpenZFS software will contribute to the efficiency and effectiveness of the HPC leadership-class systems of tomorrow.

News Item

This open-source file system framework supports hierarchical HPC storage systems by utilizing node-local burst buffers.

Project

Livermore’s archive leverages a hierarchical storage management application that runs on a cluster architecture that is user-friendly, extremely scalable, and lightning fast.

Project

Highlights include complex simulation codes, uncertainty quantification, discrete event simulation, and the Unify file system.

News Item

“If applications don’t read and write files in an efficient manner,” system software developer Elsa Gonsiorowski warns, “entire systems can crash.”

People Highlight

Livermore Computing staff is enhancing the high-speed InfiniBand data network used in many of its high performance computing and file systems.

Project

Computer scientists are incorporating ZFS into their high-performance parallel file systems for better performance and scalability.

Project

Fast Global File Status (FGFS) is an open-source package that provides scalable mechanisms and programming interfaces to retrieve global information of a file.

Project

Spindle improves the library-loading performance of dynamically linked HPC applications. It plugs into the system’s dynamic linker and intercepts its file operations so that only one process (or other small amount) will perform the file operations necessary and share the results with other processes in the job.

Project

With SCR, jobs run more efficiently, recover more work upon failure, and reduce load on critical shared resources.

Project

Working extensively with open-source software such as Lustre and engaging with the broader open-source community is what computer scientist Chris Morrone enjoys most about his job, and it is one of the features that attracted him to LLNL in the first place.

People Highlight