AI in Northumberland

From Server rental store
Jump to navigation Jump to search

AI in Northumberland: Server Configuration Documentation

This document details the server configuration for the "AI in Northumberland" project, intended as a resource for new system administrators and developers contributing to the project. It outlines the hardware, software, and network setup required to support the AI workloads. This documentation assumes a basic understanding of Linux server administration and networking concepts.

Overview

The "AI in Northumberland" project utilizes a cluster of servers to process and analyze data related to local environmental monitoring, agricultural optimization, and tourism trends. The system is designed for scalability and high availability. The core infrastructure is hosted within a secure data center in Newcastle upon Tyne, with redundant network connections to ensure continuous operation. The project leverages a combination of bare-metal servers and virtual machines for flexibility and resource management.

Hardware Configuration

The server cluster consists of three primary node types: Master Nodes, Worker Nodes, and Storage Nodes.

Master Nodes: These nodes manage the cluster, schedule jobs, and monitor system health.

Worker Nodes: These nodes perform the intensive AI computations.

Storage Nodes: These nodes provide persistent storage for datasets and models.

Here's a breakdown of the hardware specifications:

Component Master Nodes (x2) Worker Nodes (x6) Storage Nodes (x3)
CPU Intel Xeon Gold 6248R (24 cores) Intel Xeon Gold 6338 (32 cores) Intel Xeon Silver 4310 (12 cores)
RAM 128 GB DDR4 ECC 256 GB DDR4 ECC 64 GB DDR4 ECC
Storage (OS) 512 GB NVMe SSD 1 TB NVMe SSD 1 TB HDD
Storage (Data) - 4 TB NVMe SSD (RAID 0) 60 TB HDD (RAID 6)
Network Interface 2 x 10 GbE 2 x 10 GbE 2 x 1 GbE

All servers utilize a redundant power supply and are housed in a temperature-controlled rack. Detailed hardware inventory information can be found on the Hardware Inventory Page.

Software Stack

The software stack is built around Ubuntu Server 22.04 LTS. We employ Docker and Kubernetes for containerization and orchestration, providing a consistent and reproducible environment.

The following software components are essential:

  • Operating System: Ubuntu Server 22.04 LTS
  • Containerization: Docker 24.0
  • Orchestration: Kubernetes 1.27
  • Programming Languages: Python 3.10, R 4.3
  • AI Frameworks: TensorFlow 2.12, PyTorch 2.0, scikit-learn 1.2
  • Database: PostgreSQL 15
  • Message Queue: RabbitMQ 3.9
  • Monitoring: Prometheus and Grafana (See Monitoring Dashboard Setup)

Here’s a summary of software versions on each node type:

Node Type Key Software Versions
Master Nodes Kubernetes 1.27, Prometheus, Grafana, PostgreSQL 15
Worker Nodes Docker 24.0, TensorFlow 2.12, PyTorch 2.0, scikit-learn 1.2, Python 3.10, R 4.3
Storage Nodes PostgreSQL 15, various data backup tools (see Data Backup Procedures)

We utilize a centralized package repository managed by APT Repository Configuration. Regular security updates are applied automatically via unattended upgrades.

Network Configuration

The server cluster is connected to the network via two redundant 10 GbE connections. A private network is used for communication between the servers within the cluster.

Here's a simplified network diagram:

Component IP Address Subnet Mask Gateway
Master Node 1 192.168.1.10 255.255.255.0 192.168.1.1
Master Node 2 192.168.1.11 255.255.255.0 192.168.1.1
Worker Node 1-6 192.168.1.20-192.168.1.25 255.255.255.0 192.168.1.1
Storage Node 1-3 192.168.1.30-192.168.1.32 255.255.255.0 192.168.1.1

DNS resolution is handled by an internal DNS server. Firewall rules are configured using `ufw` to restrict access to the servers. Refer to the Firewall Configuration Guide for detailed information. The project also utilizes a VPN for secure remote access (see VPN Access Instructions).


Security Considerations

Security is paramount. All servers are hardened according to CIS benchmarks. Regular vulnerability scans are performed. Access control is restricted to authorized personnel only. Data is encrypted both in transit and at rest. See the Security Policy Document for a comprehensive overview.

Future Expansion

The system is designed to scale horizontally. Additional worker nodes can be added as needed to increase processing capacity. We are also evaluating the use of GPUs to accelerate AI workloads (see GPU Integration Plan).


Main Page Data Sources Deployment Guide Troubleshooting API Documentation User Access Control Data Privacy Policy Disaster Recovery Plan Monitoring Dashboard Setup Firewall Configuration Guide VPN Access Instructions APT Repository Configuration Hardware Inventory Page Data Backup Procedures Security Policy Document GPU Integration Plan


Intel-Based Server Configurations

Configuration Specifications Benchmark
Core i7-6700K/7700 Server 64 GB DDR4, NVMe SSD 2 x 512 GB CPU Benchmark: 8046
Core i7-8700 Server 64 GB DDR4, NVMe SSD 2x1 TB CPU Benchmark: 13124
Core i9-9900K Server 128 GB DDR4, NVMe SSD 2 x 1 TB CPU Benchmark: 49969
Core i9-13900 Server (64GB) 64 GB RAM, 2x2 TB NVMe SSD
Core i9-13900 Server (128GB) 128 GB RAM, 2x2 TB NVMe SSD
Core i5-13500 Server (64GB) 64 GB RAM, 2x500 GB NVMe SSD
Core i5-13500 Server (128GB) 128 GB RAM, 2x500 GB NVMe SSD
Core i5-13500 Workstation 64 GB DDR5 RAM, 2 NVMe SSD, NVIDIA RTX 4000

AMD-Based Server Configurations

Configuration Specifications Benchmark
Ryzen 5 3600 Server 64 GB RAM, 2x480 GB NVMe CPU Benchmark: 17849
Ryzen 7 7700 Server 64 GB DDR5 RAM, 2x1 TB NVMe CPU Benchmark: 35224
Ryzen 9 5950X Server 128 GB RAM, 2x4 TB NVMe CPU Benchmark: 46045
Ryzen 9 7950X Server 128 GB DDR5 ECC, 2x2 TB NVMe CPU Benchmark: 63561
EPYC 7502P Server (128GB/1TB) 128 GB RAM, 1 TB NVMe CPU Benchmark: 48021
EPYC 7502P Server (128GB/2TB) 128 GB RAM, 2 TB NVMe CPU Benchmark: 48021
EPYC 7502P Server (128GB/4TB) 128 GB RAM, 2x2 TB NVMe CPU Benchmark: 48021
EPYC 7502P Server (256GB/1TB) 256 GB RAM, 1 TB NVMe CPU Benchmark: 48021
EPYC 7502P Server (256GB/4TB) 256 GB RAM, 2x2 TB NVMe CPU Benchmark: 48021
EPYC 9454P Server 256 GB RAM, 2x2 TB NVMe

Order Your Dedicated Server

Configure and order your ideal server configuration

Need Assistance?

⚠️ *Note: All benchmark scores are approximate and may vary based on configuration. Server availability subject to stock.* ⚠️