Server rental store

AI in Norwich

# AI in Norwich: Server Configuration

This document details the server configuration for the "AI in Norwich" project, a local initiative dedicated to exploring the applications of Artificial Intelligence within the city. This guide is aimed at new contributors to the wiki and provides a comprehensive overview of the hardware and software employed. Please read carefully before making any modifications.

Overview

The “AI in Norwich” project utilizes a cluster of servers located within the Norwich Research Park data centre. The servers are primarily used for machine learning model training, data analysis, and hosting web-based AI applications. The system is designed for scalability and redundancy, allowing for future growth and minimizing downtime. This configuration focuses on the core infrastructure. Additional information regarding Data Security Protocols and Network Topology can be found in separate articles.

Hardware Configuration

The cluster consists of three primary server nodes, designated as Node-A, Node-B, and Node-C. Each node is built with similar specifications to ensure consistency and simplify maintenance. A dedicated storage server handles data persistence.

Server Node CPU RAM Storage Network Interface
Node-A Intel Xeon Gold 6248R (24 cores) 256 GB DDR4 ECC Registered 2 x 4TB NVMe SSD (RAID 1) 10 Gigabit Ethernet
Node-B Intel Xeon Gold 6248R (24 cores) 256 GB DDR4 ECC Registered 2 x 4TB NVMe SSD (RAID 1) 10 Gigabit Ethernet
Node-C Intel Xeon Gold 6248R (24 cores) 256 GB DDR4 ECC Registered 2 x 4TB NVMe SSD (RAID 1) 10 Gigabit Ethernet

The storage server, designated 'Storage-1', provides centralized storage for all nodes.

Component Specification
Host Name Storage-1
CPU Intel Xeon Silver 4210 (10 cores)
RAM 64 GB DDR4 ECC Registered
Storage 8 x 16TB SAS HDD (RAID 6)
Network Interface 10 Gigabit Ethernet

Power is supplied via redundant power supplies and a dedicated UPS system, detailed in the Power Management Documentation.

Software Configuration

All server nodes run Ubuntu Server 22.04 LTS. The software stack is designed to facilitate machine learning and data science workflows. The primary software components include Python 3.10, TensorFlow 2.12, PyTorch 2.0, and JupyterLab. A distributed file system, GlusterFS, is used to provide a unified namespace across the cluster.

Software Component Version Purpose
Operating System Ubuntu Server 22.04 LTS Server Operating System
Python 3.10.6 Primary Programming Language
TensorFlow 2.12.0 Machine Learning Framework
PyTorch 2.0.1 Machine Learning Framework
JupyterLab 3.5.0 Interactive Development Environment
GlusterFS 10.1 Distributed File System

Network Configuration

The servers are connected via a dedicated 10 Gigabit Ethernet network. Internal DNS resolution is managed by a local BIND server. Firewall rules are configured using `ufw` to restrict access to essential services. Refer to the Network Security Policy for detailed information. Each node is assigned a static IP address within the 192.168.1.0/24 subnet.

⚠️ *Note: All benchmark scores are approximate and may vary based on configuration. Server availability subject to stock.* ⚠️