Server rental store

AI in Oxford

AI in Oxford: Server Configuration

Welcome to the guide on the server configuration powering the "AI in Oxford" project. This article details the hardware and software setup responsible for running the various artificial intelligence applications and research initiatives housed within the department. This is intended as a technical reference for system administrators and developers working with the platform. This version covers the configuration as of October 26, 2023. Please refer to the Change Log for updates.

Overview

The "AI in Oxford" infrastructure is designed for high-throughput computing, large dataset storage, and rapid model training. It employs a hybrid architecture, leveraging both on-premise servers and cloud resources. The core on-premise cluster consists of a network of interconnected servers, each specialized for specific tasks. Security Considerations are paramount, with robust access controls and data encryption implemented throughout the system. The system is monitored using Nagios Monitoring and integrated with Incident Management.

Hardware Specifications

The primary compute nodes are based on the following specifications:

Component Specification Quantity
CPU Intel Xeon Gold 6338 (32 cores, 64 threads) 16
RAM 256 GB DDR4 ECC Registered RAM 16
GPU NVIDIA A100 (80GB) 8
Storage (Local) 4 TB NVMe PCIe Gen4 SSD 16
Network Interface 2 x 100 GbE Mellanox ConnectX-6 16

Storage is handled by a dedicated network-attached storage (NAS) cluster.

Component Specification Quantity
Storage Type Seagate Exos X18 18TB SAS HDD 64
RAID Level RAID 6 N/A
File System ZFS N/A
Network Interface 4 x 40 GbE Mellanox ConnectX-5 2
Total Usable Capacity ~800 TB N/A

Finally, the front-end servers are slightly less powerful, serving as access points and managing user authentication.

Component Specification Quantity
CPU Intel Xeon E-2388G (8 cores, 16 threads) 4
RAM 64 GB DDR4 ECC Registered RAM 4
Storage (Local) 1 TB NVMe PCIe Gen3 SSD 4
Network Interface 2 x 10 GbE Intel X710 4

Software Stack

The servers run a customized distribution of Ubuntu Server 22.04. The core software components include:

⚠️ *Note: All benchmark scores are approximate and may vary based on configuration. Server availability subject to stock.* ⚠️