Server rental store

AI in Poole

# AI in Poole: Server Configuration

This document details the server configuration for the "AI in Poole" project, outlining hardware, software, and networking details. This is intended as a guide for new administrators and developers working with the system. This project utilizes a distributed computing model to support the intensive processing requirements of large language models. Please refer to Main Page for project overview.

Overview

The "AI in Poole" infrastructure consists of a cluster of servers located in a dedicated data center in Poole, UK. The primary function of these servers is to host and operate large language models, providing API access for various applications. The cluster is designed for high availability, scalability, and performance. See System Architecture for a diagram of the overall system. This project heavily leverages Docker containers for environment isolation and reproducibility. Regular backups are performed using Backup Procedures.

Hardware Specifications

The server cluster comprises three main types of nodes: Master Nodes, Compute Nodes, and Storage Nodes. Each node type has specific hardware requirements.

Node Type CPU RAM Storage Network Interface
Master Node | 2x Intel Xeon Gold 6338 | 128 GB DDR4 ECC | 2x 1 TB NVMe SSD (RAID 1) | 10 Gbps Ethernet |
Compute Node | 2x AMD EPYC 7763 | 256 GB DDR4 ECC | 4x 4 TB NVMe SSD (RAID 0) | 100 Gbps InfiniBand |
Storage Node | 2x Intel Xeon Silver 4310 | 64 GB DDR4 ECC | 8x 16 TB SATA HDD (RAID 6) | 10 Gbps Ethernet |

These specifications are subject to change as the project evolves. Refer to Hardware Inventory for the most up-to-date listing of individual server details. Power consumption is monitored via Power Monitoring System.

Software Stack

The "AI in Poole" servers run a customized Linux distribution based on Ubuntu 22.04 LTS. Key software components include:

⚠️ *Note: All benchmark scores are approximate and may vary based on configuration. Server availability subject to stock.* ⚠️