Data Carpentry workshop: Compational Resources at Berkeley and Beyond

August 7, 2018

Chris Paciorek

Outline

Berkeley Research IT helps researchers through its Berkeley Research Computing (BRC) and Research Data Management (RDM) programs.

This (brief) session will cover the following topics:

System capabilities and hardware

Berkeley Research Computing runs the campus cluster, Savio.

Getting access to the system - FCA and condo

Faculty/principal investigators can allow researchers working with them to get user accounts with access to the FCA or condo resources available to the faculty member.

Savio computing nodes

Savio provides access to the following types of computational resources:

Let's take a look at the hardware specifications of the computing nodes on the cluster (see the Hardware Configuration section of this document).

The nodes are divided into several pools, called partitions. These partitions have different restrictions and costs associated with them (see the Scheduler Configuration section of this document). Any job you submit must be submitted to a partition to which you have access.

Submitting jobs: accounts and partitions

All computations are done by submitting jobs to the scheduling software that manages jobs on the cluster, called SLURM.

Here's an example job script for a batch job. You'll need to modify the various "--" flags for your own work.

#!/bin/bash
# Job name:
#SBATCH --job-name=test
#
# Account:
#SBATCH --account=fc_paciorek
#
# Partition:
#SBATCH --partition=savio2
#
# Number of tasks (e.g., MPI tasks, here 2 nodes worth)
#SBATCH --ntasks=48
#
# Wall clock limit (30 minutes here):
#SBATCH --time=00:30:00
#
## Command(s) to run:
module load python/3.6
python calc.py >& calc.out

Genomics on Savio: things to keep in mind

Disk space options on Savio and on campus broadly

Here are some options for moderate-large disk storage options:

More details on Savio storage are here here in the Storage and Backup section.

Data transfer for large data

Some options include:

NSF's XSEDE: To infinity and beyond

XSEDE provides free access to NSF-funded clusters around the US, e.g.,

Accessing XSEDE

Several modes of (free) access:

BRC can help with all of these.

How to get additional help

Don't hesitate to contact us; we're friendly, even with basic questions.

Upcoming events