Skip to main content

One post tagged with "Cloud Computing"

Cloud Computing

View All Tags

Slurm: Your Cluster's Traffic Controller

· 23 min read
info

If you're only working on the BASIC Lab server, Slurm might not be necessary yet. However, if you plan to use NCHC resources, then learning Slurm is a must. All NCHC clusters are managed through Slurm.

Introduction

So you've finally gotten access to NCHC's fancy compute cluster. No more waiting three days using slow and outdated GPUs to finish training that model. You're ready for the big leagues!

You SSH in, navigate to your directory, and type the command you've been using on your local machine for months: python my_awesome_script.py

And then... someone taps you on the shoulder (virtually or literally) and says, "Oh no, you can't just run things directly here. You need to submit it through Slurm."