Distributed Computing

In this assignment, we will install and configure Slurm Manager. The master is obviously the master node, and the other two VMs are the compute nodes. There is a good tutorial available https://www.invik.xyz/work/Slurm-on-Ubuntu-Trusty/. I won’t repeat everything it says. I’ll make general comments below.


My install did not have the configurator they mention in the tutorial, so I used the online one: https://slurm.schedmd.com/configurator.html. I kept most of the defaults and only tweaked for master, server1, etc. Honestly, it’s probably a good idea to start with just everything on master, and then incorporate the other compute nodes.

I had to install this as well. sudo apt-get install mailutils

sinfo and other commands are good for showing success. You can also try a script that shows it is running on different hostnames, such as:

#SBATCH -p debug
#SBATCH -n 1
#SBATCH -t 12:00:00
#SBATCH -J some_job_name

hostname > /shared/`hostname`.out

There are also a lot of other resources online such as https://thehatteronline.com/2014/11/18/turn-your-workstation-into-a-mini-grid-with-slurm/