Hardware : Différence entre versions
(→Dell T640 GPU nodes (2018-2019-2020)) |
|||
Ligne 163 : | Ligne 163 : | ||
= GPU nodes = | = GPU nodes = | ||
+ | |||
+ | == HPE DL385 GPU nodes (2023) == | ||
+ | |||
+ | '''HPE DL385''' dual-AMD EPYC 7662 @ 2.0GHz (64 cores) : 1 node ( '''nefgpu56''' ) | ||
+ | * Nvidia A100 PCIe GPUs cards | ||
+ | ** 6912 CUDA cores per card | ||
+ | ** 432 tensor cores per card | ||
+ | ** 40GB of RAM capacity per card | ||
+ | ** Tensor performance peak : 155.92 TFlops per card | ||
+ | ** Simple precision performance peak: 77.97 Tflops per card | ||
+ | ** Double precision performance peak: 19.49 Tflops per card | ||
+ | ** 1555 GB/s GPU memory bandwidth with error correction (ECC) | ||
+ | * 2x gigabit network ports (one connected) | ||
+ | * infiniband EDR card (connected to FDR switch) | ||
+ | * hyperthreading active | ||
+ | {| class="wikitable" | ||
+ | |+Node details | ||
+ | |- | ||
+ | | '''Node name''' | ||
+ | | '''Funding team''' | ||
+ | | '''GPU cards''' | ||
+ | | '''Node CPU''' | ||
+ | | '''Node RAM''' | ||
+ | | '''Node storage''' | ||
+ | |- | ||
+ | | nefgpu56 | ||
+ | | STARS | ||
+ | | 3x A100 | ||
+ | | 2x AMD EPYC 7662 | ||
+ | | 1024 GB | ||
+ | | system & /tmp : RAID-1 2x 480SSD<br>/local/mixed : RAID-5 5x 960 SSD|} | ||
+ | |||
== Dell R7525 GPU nodes (2020) == | == Dell R7525 GPU nodes (2020) == | ||
− | '''Dell R7525''' dual-EPYC 7282 @ 2.8GHz (16 cores) : 2 nodes ( '''nefgpu52 to nefgpu53''' )<br> | + | '''Dell R7525''' dual-AMD EPYC 7282 @ 2.8GHz (16 cores) : 2 nodes ( '''nefgpu52 to nefgpu53''' )<br> |
− | '''Dell R7525''' dual-EPYC 7413 @ 2.65GHz (24 cores) : 2 nodes ( '''nefgpu54 to nefgpu55''' ) | + | '''Dell R7525''' dual-AMD EPYC 7413 @ 2.65GHz (24 cores) : 2 nodes ( '''nefgpu54 to nefgpu55''' ) |
* Nvidia A40 PCIe GPUs cards | * Nvidia A40 PCIe GPUs cards | ||
** 10752 CUDA cores per card | ** 10752 CUDA cores per card |
Version du 17 mars 2023 à 18:19
CPU & GPU summary
The current platform includes:
- 16 dual-Xeon SP 10 cores, 9.6GB/core RAM
- 1 quad-Xeon SP 20 cores, 12.8GB/core RAM
- 8 dual-Xeon 8 cores, 16GB/core RAM
- 16 dual-Xeon 10 cores, 9.6GB/core RAM
- 6 quad-Opteron 16 cores, 4GB/core RAM
- 6 quad-Opteron 12 cores, ~5.3GB/core RAM
- 1 dual-EPYC 32 cores, 16GB/core RAM
- 1 dual-EPYC 32 cores, 6GB/core RAM
- 2 dual-EPYC 16 cores, 8GB/core RAM
- 2 dual-EPYC 24 cores, 10GB/core RAM
- 164 Nvidia GPU
The number of CPU cores available is :
- 144 Xeon SP Gold 2.6GHz cores : ~6.7 TFlops AVX-512 (9.2 TFlops peak)
- 320 Xeon SP Silver 2.2GHz cores : ~8.8 TFlops AVX2 (11.2 TFlops peak) / ~6.4 TFlops AVX-512 (9.7 TFlops peak)
- 80 Xeon SP Gold 2.4GHz cores : ~2.4 TFlops AVX2 (3.1 Tflops peak) / ~3.8 Tflops AVX-512 (4.9 Tflops peak)
- 128 Xeon E5 v2 2.6GHz cores : ~2.4 Tflops (2.7 Tflops peak)
- 320 Xeon E5 v2 2.8GHz cores : ~5.9 Tflops (7.2 Tflops peak)
- 384 Opteron 2.3GHz cores : ~2.7 Tflops (3.5 Tflops peak)
- 288 Opteron 2.2GHz cores : ~2 Tflops (2.5 Tflops peak)
- 64 EPYC 2.9GHz cores : ~1.5 Tflops AVX2
- 64 EPYC 2.5GHz cores : ~1.25 Tflops AVX2
- 32 EPYC 2.8GHz cores : ~1.25 Tflops AVX2
- 48 EPYC 2.65GHz cores : ~1.25 Tflops AVX2
The total cumulated CPU computing power is close to ~37.9 TFlops in double precision (44,5 Tflops peak)
The number of GPU cores available is :
- 625088 Streaming Processor Cores
CPU clusters
Dell R7525 nodes (2020)
Dell R7525 dual-EPYC 7502 @ 2.5GHz (64 cores) : 1 node ( nef059 )
- RAM capacity : 384 GB RAM
- storage : system 2x900 GB SATA SSD RAID-1 + local scratch data 7x7.5TB SAS HDD 44 TB RAID-5 + controller H745
- 2x gigabit network ports (one connected)
- infiniband EDR card (connected to FDR switch)
- hyperthreading active
Dell R7525 dual-EPYC 7542 @ 2.9GHz (64 cores) : 1 node ( nef058 )
- RAM capacity : 1024 GB RAM
- storage : system 2x223 GB SATA SSD RAID-1 + local scratch data 5x445GB SAS SSD 1.8 TB RAID-5 + controller H745
- 2x gigabit network ports (one connected)
- infiniband EDR card (connected to FDR switch)
- hyperthreading active
Dell C6420 cluster (2019)
Dell C6420 dual-Xeon Cascade Lake SP Gold 6240 @ 2.60GHz (36 cores) : 4 nodes ( nef054 to nef057 )
- RAM capacity : 384 GB RAM
- storage : system 2x600 GB SATA RAID-1 + local scratch data 960 BB SATA SSD RAID-0 + controller H330
- 1x gigabit network port
- 1x infiniband FDR card
- hyperthreading active
- optimal performance with AVX-512, AVX/AVX2 support
Dell R940 node (2017)
Dell R940 quad-Xeon SP Gold 6148 @ 2.40GHz (80 cores) : 1 node ( nef053 )
- RAM capacity : 1024 GB RAM
- storage : system 2x600 GB SATA RAID-1 + local scratch data 1.92 TB SATA SSD + controller H740P
- 4x gigabit network ports (one connected)
- infiniband EDR card (connected to FDR switch)
- hyperthreading active
- optimal performance with AVX-512, AVX/AVX2 support
Dell C6420 cluster (2017)
Dell C6420 dual-Xeon Skylake SP Silver 4114 @ 2.20GHz (20 cores) : 16 nodes ( nef037 to nef052 )
- RAM capacity : 192 GB RAM
- 1x600GB 10kRPM SAS HardDisk drive
- 1x gigabit network port
- 1x infiniband FDR card
- hyperthreading active
- AVX-512 support, optimal performance with AVX/AVX2
Dell C6220 cluster (2015)
Dell C6220 dual-Xeon E5-2650 v2 @ 2.60GHz (16 cores) : 8 nodes ( nef029 to nef036 )
- RAM capacity : 256 GB RAM
- 1x1TB SATA HardDisk drive
- 2x gigabit network ports (one connected)
- 2x infiniband QDR card (one connected)
- hyperthreading not active
Dell C6220 cluster (2014)
Dell C6220 dual-Xeon E5-2680 v2 @ 2.80GHz (20 cores) : 16 nodes ( nef013 to nef028 )
- RAM capacity : 192 GB RAM
- 1x2TB SATA HardDisk drive
- 2x gigabit network ports (one connected)
- 1x infiniband FDR card (QDR used)
- hyperthreading not active
Dell C6145 cluster (2013)
Dell C6145 quad-Opterons 6376 @ 2.3Ghz (64 cores) : 6 nodes ( nef007 to nef012 )
- RAM capacity : 256 GB RAM (512GB on nef011 and nef012)
- 1x500GB SATA HardDisk drive
- 2x gigabit network ports (one connected)
- 1x infiniband QDR card
- hyperthreading not supported
Dell R815 cluster (2010)
Dell R815 quad-Opterons 6174 @ 2.2Ghz (48 cores) : 6 nodes ( nef001 to nef006 )
- RAM capacity : 256 GB RAM
- 2x600GB SAS HardDisk drive (RAID-0)
- 4x gigabit network ports (one connected)
- 1x infiniband QDR card
- hyperthreading not supported
GPU nodes
HPE DL385 GPU nodes (2023)
HPE DL385 dual-AMD EPYC 7662 @ 2.0GHz (64 cores) : 1 node ( nefgpu56 )
- Nvidia A100 PCIe GPUs cards
- 6912 CUDA cores per card
- 432 tensor cores per card
- 40GB of RAM capacity per card
- Tensor performance peak : 155.92 TFlops per card
- Simple precision performance peak: 77.97 Tflops per card
- Double precision performance peak: 19.49 Tflops per card
- 1555 GB/s GPU memory bandwidth with error correction (ECC)
- 2x gigabit network ports (one connected)
- infiniband EDR card (connected to FDR switch)
- hyperthreading active
Node name | Funding team | GPU cards | Node CPU | Node RAM | Node storage | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
nefgpu56 | STARS | 3x A100 | 2x AMD EPYC 7662 | 1024 GB | }
Dell R7525 GPU nodes (2020)Dell R7525 dual-AMD EPYC 7282 @ 2.8GHz (16 cores) : 2 nodes ( nefgpu52 to nefgpu53 )
SuperMicro 4029 node (2019)SuperMicro 4029GP-TVRT : 1 node (nefgpu41 )
Asus ESC8000 GPU node (2018)Asus ESC8000G4 / Carri HighServer nodes : 1 node (nefgpu40 )
Dell R740 GPU nodes (2019)Dell R740 nodes: dual-Xeon Skylake or CascadeLake SP : 5 nodes ( nefgpu42 to nefgpu46)
Dell T640 GPU nodes (2018-2019-2020)Dell T640 nodes: dual-Xeon Skylake or CascadeLake SP : 21 nodes ( nefgpu{24-39} and nefgpu{47-51})
Dell T630 GPU nodes (2016-2017)Dell T630 nodes: dual-Xeon E5-26xx : 17 nodes ( nefgpu07 to nefgpu23)
Dell R730 GPU node (2016)Dell R730 nodes: dual-Xeon E5-2623v4 @ 2.6 GHz : 1 node ( nefgpu01)
StorageAll nodes have access to common storage :
More details about quotas at: Disk space management |