Topic: Storage, File Systems, and I/O

This interview with HPC-AI Vanguard Kathryn Mohror covers her thoughts on teamwork, her projects, the field, and more.

News Item

The latest issue of R&D World's magazine showcases LLNL's 2024 winning technologies, including UnifyFS and UMap software projects.

News Item

The Association for Computing Machinery's (ACM) Special Interest Group on High Performance Computing (SIGHPC) has awarded Kathryn Mohror with its prestigious Emerging Woman Leader in Technical Computing (EWLTC) Award.

News Item

With SCR, jobs run more efficiently, recover more work upon failure, and reduce load on critical shared resources.

Project

Bugs, broken codes, or system failures require added time for troubleshooting and increase the risk of data loss. LLNL has addressed failure recovery by developing the Scalable Checkpoint/Restart (SCR) framework.

News Item

This open-source file system framework accelerates hierarchical HPC I/O operations with effective, efficient use of node-local storage.

Project

Discover how the software architecture and storage systems that will drive El Capitan’s performance will help LLNL and the NNSA Tri-Labs push the boundaries of computational science.

News Item

LLNL's Ian Lee joins a Dots and Bridges panel to discuss HPC as a critical resource for data assimilation and numerical weather prediction research.

News Item

A research team from Oak Ridge and Lawrence Livermore national labs won the first IPDPS Best Open-Source Contribution Award for the paper “UnifyFS: A User-level Shared File System for Unified Access to Distributed Local Storage.”

News Item

LC’s adaptation of OpenZFS software provides high performance parallel file systems with better performance and scalability.

Project

A multidecade, multi-laboratory collaboration evolves scalable long-term data storage and retrieval solutions to survive the march of time.

News Item

LLNL is home to the world’s largest Spectra TFinityTM system, which offers the speed, agility, and capacity required to take LLNL into the exascale era.

Project

As Computing’s sixth Fernbach Fellow, postdoctoral researcher Chen Wang will work on a new I/O programming paradigm and improve HPC storage consistency models under the mentorship of Kathryn Mohror.

People Highlight

Livermore’s archive leverages a hierarchical storage management application that runs on a cluster architecture that is user-friendly, extremely scalable, and lightning fast.

Project

LLNL participates in the International Parallel and Distributed Processing Symposium (IPDPS) on May 30 through June 3.

News Item

From molecular screening, a software platform, and an online data to the computing systems that power these projects.

Project

El Capitan will have a peak performance of more than 2 exaflops—roughly 16 times faster on average than the Sierra system—and is projected to be several times more energy efficient than Sierra.

Project

Supported by the Advanced Simulation and Computing program, Axom focuses on developing software infrastructure components that can be shared by HPC apps running on diverse platforms.

Project

Highlights include complex simulation codes, uncertainty quantification, discrete event simulation, and the Unify file system.

News Item

“If applications don’t read and write files in an efficient manner,” system software developer Elsa Gonsiorowski warns, “entire systems can crash.”

People Highlight

Livermore Computing staff is enhancing the high-speed InfiniBand data network used in many of its high performance computing and file systems.

Project

Fast Global File Status (FGFS) is an open-source package that provides scalable mechanisms and programming interfaces to retrieve global information of a file.

Project

Spindle improves the library-loading performance of dynamically linked HPC applications by plugging into the system’s dynamic linker and intercepting its file operations.

Project