Topic: Resource and Workflow Management

Livermore researchers have developed a toolset for solving data center bottlenecks. Flux offers a framework that enables new resource types, schedulers, and framework services to be deployed as data centers continue to evolve.

Project

This video describes Flux, an open-source software framework that manages and schedules computing workflows to maximize available resources to run applications faster and more efficiently.

News Item

LLNL, IBM and Red Hat are combining forces to develop best practices for interfacing HPC schedulers and cloud orchestrators, an effort designed to prepare for emerging supercomputers that take advantage of cloud technologies.

News Item

CTO Bronis de Supinski discusses the integrated storage strategy of the future El Capitan exascale supercomputing system, which will have in excess of 2 exaflops of raw computing power spread across nodes.

News Item

A near node local storage innovation called Rabbit factored heavily into LLNL’s decision to select Cray’s proposal for its CORAL-2 machine, the lab’s first exascale-class supercomputer, El Capitan.

News Item

Computing’s fourth annual Developer Day was held as a virtual event on July 30 with 8 speakers and 90 participants.

News Item

LLNL computer scientist Stephen Herbein discusses the open-source Flux Framework HPC software on this video episode of Next Platform TV. His segment begins at 27:34.

News Item

The Maestro Workflow Conductor is a lightweight, open-source Python tool that can launch multi-step software simulation workflows in a clear, concise, consistent, and repeatable manner.

Project

New year, new hackathon! The January 30–31 event was Computing’s 23rd hackathon and the 1st scheduled in the winter season.

News Item

In the first of three news features, LLNL-developed Flux software will contribute to the efficiency and effectiveness of the HPC leadership-class systems of tomorrow.

News Item

Highlights include CASC director Jeff Hittinger's vision for the center as well as recent work with PruneJuice DataRaceBench, Caliper, and SUNDIALS.

News Item

Highlights include complex simulation codes, uncertainty quantification, discrete event simulation, and the Unify file system.

News Item

Cram lets you easily run many small MPI jobs within a single, large MPI job by splitting MPI_COMM_WORLD up into many small communicators to run each job in the cram file independently.

Project

A comprehensive understanding of the performance behavior of large-scale simulations requires the ability to compile, analyze, and compare measurements and contexts from many independent sources. Caliper, a general-purpose application introspection system, makes that task easier by connecting various independent context annotations, measurement services, and data processing services.

Project