How to Run dsgrid on Kestrel¶

This guide explains how to run dsgrid on NLR’s Kestrel HPC system.

Steps¶

SSH to a login node and start a screen session (or similar tool like tmux):

screen -S dsgrid

This allows you to maintain your session even if you disconnect.

Follow the installation instructions at Installation.

Create a dsgrid runtime config file pointing to the shared registry:

dsgrid config create sqlite:////projects/dsgrid/standard-scenarios.db

This configures dsgrid to use the NLR shared registry database.

Start a Spark cluster with your desired number of compute nodes by following the instructions at Start Spark Cluster on Kestrel.

Run all CPU-intensive dsgrid commands from the first node in your HPC allocation using spark-submit:

spark-submit --master=spark://$(hostname):7077 $(which dsgrid-cli.py) [command] [options] [args]

Examples:

spark-submit --master=spark://$(hostname):7077 $(which dsgrid-cli.py) \
    registry datasets register dataset.json5 \
    -l "Register my dataset"

Run a query:

spark-submit --master=spark://$(hostname):7077 $(which dsgrid-cli.py) \
    query project run query.json5

Because you started a screen session at the beginning, if you disconnect from your SSH session for any reason, you can pick your work back up:

screen -r dsgrid

List available registries:

dsgrid registry list

Check project details:

dsgrid registry projects show <project-id>

Validate a config file:

dsgrid config validate dataset.json5