Configuration

Hardware

Node TypeStandardLarge MemoryGPU GPU MIG40SPGPURTX6000 BlackwellVIZ | VIZ-LONG
Partition Namestandardlargememgpugpu_mig40spgpugpu-rtx6000viz | viz-long
Number of Nodes4558252282 public, 10 owned4
Processors2x 3.0 GHz Intel Xeon Gold 61542x 3.0 GHz Intel Xeon Gold 61542x 2.4 GHz Intel Xeon Gold 61482x 2.60 GHz Intel Xeon Platinum 8358P2x 2.9 GHz Intel Xeon Gold 6226R2x 3.30GHz AMD EPYC 9575F2x 2.4 GHz Intel Xeon Gold 6148
Cores per Node363640643212840
RAM187 GB (180 GB requestable)1.5 TB (1,503 GB requestable)187 GB (180 GB requestable)1007 GB (1000 GB requestable)376 GB (372 GB requestable)1.5TB187 GB (180 GB requestable)
Storage480 GB SSD + 4 TB HDD4 TB HDD4 TB HDD890 GB SSD + 3.5 TB SSD480 GB SSD + 14 TB NVMe SSD15 TB NVMe4 TB HDD
GPUN/AN/A

20 nodes: 2x NVIDIA Tesla V100 16GB

4 nodes: 3x V100 16GB

2 nodes: 4x NVIDIA A100 80GB, divided into 2 40GB MIG instances (16 total)28 nodes: 8x NVIDIA A40 48GB16 public RTX6000 Blackwell Server Edition4 nodes: 1x NVIDIA Tesla P40 24GB

GPUs

Great Lakes has 52 NVIDIA Tesla V100 GPUs connected to 24 nodes and 4 NVIDIA A100 80GB GPUs connected to 1 node. 160 NVIDIA A40 GPUs connected to 28 nodes are also available for single-precision work.

GPU ModelNVIDIA Tesla V100NVIDIA A40NVIDIA A100NVIDIA RTX6000 Pro Blackwell
GPU ArchitectureVoltaAmpereAmpereBlackwell
Peak double precision floating point perf.7.066 TFLOPSN/A

9.7 TFLOPS (non-Tensor)

19.5 TFLOPS (Tensor)

N/A
Peak single precision floating point perf.14.13 TFLOPS

37.4 TFLOPS (non-Tensor)

74.8 TFLOPS (Tensor)

19.5 TFLOPS (non-tensor)

156 TFLOPS (Tensor)

120.0 TFLOPS

4000 AI TOPS

Memory bandwidth (ECC off)897.0 GB/s696 GB/s1935 GB/s1597 GB/s
Memory size (GDDR5)16 GB HBM248 GB GDDR580 GB HBM2e96 GB GDDR7
CUDA cores512010752691224064
RT coresN/A84N/A188
Tensor cores640336432752

Networking

The compute nodes are all interconnected with InfiniBand HDR100 networking, capable of 100 Gb/s throughput. In addition to the InfiniBand networking, there is 25 Gb/s ethernet for the login and transfer nodes and a gigabit Ethernet network that connects the remaining nodes. This is used for node management and NFS file system access.

Storage

The high-speed scratch file system provides 2 petabytes of storage at approximately 80 GB/s performance.

Scheduling & Billing

Computing jobs scheduling and billing on Great Lakes are managed completely through the Slurm Workload Manager 

Operating Software

The Great Lakes cluster runs Redhat 8. We update the operating system on Great Lakes as Redhat releases new versions and our library of third-party applications offers support. Due to the need to support several types of drivers (AFS and Lustre file system drivers, InfiniBand network drivers and NVIDIA GPU drivers) and dozens of third party applications, we are cautious in upgrading and can lag Redhat releases by months.

Compilers, Parallel, & Scientific Libraries

Great Lakes supports the Gnu Compiler Collection, the Intel Compilers, and the PGI Compilers for C and Fortran. The Great Lakes cluster’s parallel library is OpenMPI.  Great Lakes provides the Intel Math Kernel Library (MKL) set of high-performance mathematical libraries. Other common scientific libraries are compiled from source and include HDF5, NetCDF, FFTW3, Boost, and others.

Application Software

For detailed information, see the software page. (link TBD)