Storage Guide

Understanding Storage on Compute Nodes

Storage types based on node accessibility

When working on any compute node within the system, there are two types of storage accessible to a node: local storage and shared storage. The former is only accessible by a particular node, while the latter is accessible by all nodes.

On every node, the path /mnt/local/scratch/$USER points to local storage. That means if you have two nodes N1 and N2, anything stored at /mnt/local/scratch/$USER in N1 won’t be accessible by N2 and vice-versa. In contrast, the paths /work/$USER and /home/$USER point to a location in shared storage and are thus accessible by both N1 and N2.

The described storage architecture is depicted in the diagram below:

Logical Storage Architecture

Filesystem locations users should understand

Scratch

  • Location on compute nodes: /mnt/local/scratch/$USER.

  • Shortcut Variable: $SCRATCH

  • Local to each compute node: This storage is temporary, and local to a compute node. It is available to a user only when a node is assigned for running jobs

  • High performance: Offers significantly faster read/write speeds compared to home and work storage.

  • Limited capacity: the capacity varies as per the node type and can be anywhere between 480 GB to 30 TB. Please refer to the table on the Zurada hardware specs for details

  • Data retention policy: ALL DATA IS REMOVED after a job finishes.

Work

  • Location in filesystem: /work/$USER.

  • Shortcut Variable: $WORK

  • Shared across all nodes: Accessible from any compute node in the system.

  • Large capacity: Designed to store big files, datasets and, in general, input/output files used/produced by a job.

  • Slower access: Due to its shared nature, read/write operations are generally slower than scratch storage.

  • Parallel I/O: Multiple parts of large files can be accessed simultaneously (e.g. MPI-IO).

  • Data retention policy: Files not accessed within 30 days are candidate for deletion.

Home

  • Location in filesystem: /home/$USER.

  • Shortcut Variable: $HOME or ~

  • Shared across all nodes: Accessible from any compute node in the system.

  • Limited capacity: Hard quota limit of 25GB per user. If you try to write more than 25G, an error will be displayed and any subsequent write operations will be denied.

  • Slower access: Due to its shared nature, read/write operations are generally slower than local scratch storage.

  • Data retention policy: Data is kept and backed up for 7 days.

Note

Shortcut variables in bash are set upon entry to a Zurada node, so you may utilize them while inside the system (scripts, navigation, etc). If you are using a command (ex. scp, or rsync) from your laptop/desktop utilization of the shortcut variables will not work.