Server rental store

AI Education

AI Education Server Configuration

This document details the server configuration for the "AI Education" project, designed to support a suite of tools for learning and experimenting with Artificial Intelligence. This guide is intended for new system administrators and developers contributing to the platform. It covers hardware specifications, software stack, and key configuration details.

Overview

The AI Education server is built to provide a robust and scalable environment for users to access and utilize AI-related resources. The primary goals are to support interactive tutorials, code execution, and model training, all within a secure and manageable infrastructure. We utilize a distributed architecture to maximize performance and availability. See Server Architecture Overview for a broader context. This server is distinct from the Data Analysis Server and the Content Delivery Network.

Hardware Specifications

The core server utilizes the following hardware components. Redundancy is built in at multiple levels to ensure high availability.

Component Specification Quantity
CPU Intel Xeon Gold 6338 (32 cores, 64 threads) 2
RAM 256 GB DDR4 ECC Registered 1
Storage (OS/Boot) 500 GB NVMe SSD 1
Storage (Data) 8 x 4TB SAS HDD (RAID 6) 1 Array
Network Interface 10 Gigabit Ethernet 2
GPU NVIDIA A100 (80GB) 4

We also utilize a separate storage cluster detailed in the Storage Cluster Documentation. This cluster is accessed via NFS.

Software Stack

The AI Education server is built on a Linux foundation, utilizing a combination of open-source and commercially supported software.

Software Version Purpose
Operating System Ubuntu Server 22.04 LTS Base OS and System Management
Containerization Docker 24.0.5 Application Isolation and Deployment
Container Orchestration Kubernetes 1.27 Automating deployment, scaling, and management of containerized applications
Programming Languages Python 3.10, R 4.3.1 Core languages for AI development and scripting. See Supported Languages for details.
Machine Learning Frameworks TensorFlow 2.13, PyTorch 2.0, scikit-learn 1.3 Libraries for building and training AI models. Refer to Framework Compatibility.
Database PostgreSQL 15 Metadata storage and user data management. See Database Schema.
Web Server Nginx 1.25 Reverse proxy and load balancer. Configuration details are in Nginx Configuration.

Configuration Details

Several key configuration elements are critical to the operation of the AI Education server.

Network Configuration

⚠️ *Note: All benchmark scores are approximate and may vary based on configuration. Server availability subject to stock.* ⚠️