While installing a computer cluster requires some foundational skills and experience, getting it up and running is only part of the job! Understanding how your system works, how to protect it from threats like malware and viruses, and how to maintain it for optimal performance requires next-level skills. At the HPC Cluster Engineer Academy, you’ll explore all of this through team-based projects. Teams investigate problems and possible solutions, and then presents their findings at a special symposium for summer scholars. Check out the projects from previous years below.
HPC Cluster Engineer Academy 2023 Presentations
- Account and Authorization Management System
- CerebrasGPT Web Application
- Flux RestAPI
HPC Cluster Engineer Academy 2022 Presentations
- Installation of Flux
- Installing & Configuring Lustre on KVMs
- RabbitMQ and Kafka
HPC Cluster Engineer Academy 2021 Presentations
- Building a High Availability NFS Server
- Slurm Rest API
- Survey of HPC Containers Tools
- Video of the above presentations
HPC Cluster Engineer Academy 2020 Presentations
- Centralized Node Attribute DB for HPC
- Architecture-Level Application Optimization
- KVM on LC Clusters
HPC Cluster Engineer Academy 2019 Presentations
- MongoNAS
- Object Storage Investigation
- Merlin Workflow Tools
HPC Cluster Engineer Academy 2018 Presentations
- Computer Monitoring with Prometheus & Grafana
- Expanding Livmomi
- HPSS Deployment Automation
HPC Cluster Engineer Academy 2017 Presentations
- Ceph: A Distributed File System
- Elastic Stack Installation & Configuration
- Kubernetes Implementation into HPC
HPC Cluster Engineer Academy 2016 Presentations
- The Power of Heterogeneous Computing on GPUs
- Monitoring Clusters Using Sqrl
- Proof-of-Concept for Heterogeneous GPU Computing
- Tory Computer Inventory Script