AI in the Cambodian Rainforest

From Server rental store
Jump to navigation Jump to search
  1. AI in the Cambodian Rainforest: Server Configuration

This document details the server configuration used to support the "AI in the Cambodian Rainforest" project. This project utilizes artificial intelligence for real-time analysis of audio and visual data collected from remote sensors deployed within the Cardamom Mountains of Cambodia. The goal is to monitor biodiversity, detect illegal logging, and track wildlife populations. This guide is intended for new system administrators and developers contributing to the project.

Overview

The server infrastructure is designed for high availability, scalability, and data security. It is a hybrid architecture, utilizing both on-premise servers for immediate data processing and cloud-based resources for long-term storage and complex model training. The system relies heavily on Python for data processing and machine learning tasks, with a PostgreSQL database for storing metadata and analysis results. We use Docker extensively for containerization and deployment. The entire system is monitored using Nagios.

Hardware Specifications

The on-premise servers are located in a secure, climate-controlled facility in Phnom Penh, Cambodia. These servers are the primary interface with the sensor network and perform initial data filtering and analysis.

Component Specification Quantity
CPU Intel Xeon Gold 6248R (24 cores, 3.0 GHz) 2
RAM 256 GB DDR4 ECC Registered 2
Storage (OS & Apps) 2 x 1TB NVMe SSD (RAID 1) 2
Storage (Temporary Data) 8 x 4TB SATA HDD (RAID 6) 2
Network Interface Dual 10 Gigabit Ethernet 2
Power Supply Redundant 1200W 80+ Platinum 2

Software Stack

The software stack is built around a Linux operating system and a set of open-source tools. We utilize Ubuntu Server 22.04 LTS as our base operating system.

Software Version Purpose
Operating System Ubuntu Server 22.04 LTS Base operating system
Database PostgreSQL 14 Metadata storage and analysis results
Programming Language Python 3.10 Data processing, machine learning
Web Server Nginx 1.22 Serving API endpoints and web interface
Containerization Docker 20.10 Application deployment and isolation
Message Queue RabbitMQ 3.9 Asynchronous task processing
Machine Learning Framework TensorFlow 2.9 Deep learning model execution
Data Visualization Grafana 8.5 Monitoring and data visualization

Network Configuration

The network is segmented into three zones: a public zone for external access, a DMZ for the web server and API endpoints, and a private zone for the database and data processing servers. Firewall rules are implemented to restrict access between zones. We utilize a VPN for secure remote access.

Zone IP Range Access Control
Public 203.0.113.0/24 Limited to specific ports (80, 443)
DMZ 192.168.1.0/24 Access from Public zone via firewall
Private 10.0.0.0/24 Access from DMZ zone via firewall, restricted to specific services

Data Flow

1. Sensor data is transmitted wirelessly to a local gateway. 2. The gateway forwards the data to the on-premise servers via a secure network connection. 3. On-premise servers perform initial data filtering and preprocessing. 4. Filtered data is transmitted to the cloud for long-term storage and complex analysis using Amazon S3. 5. Machine learning models are trained in the cloud using Amazon SageMaker. 6. Trained models are deployed to the on-premise servers for real-time inference. 7. Analysis results are stored in the PostgreSQL database. 8. Data visualizations are generated using Grafana.

Security Considerations

  • All data transmissions are encrypted using TLS.
  • Regular security audits are conducted to identify and address vulnerabilities.
  • Access control lists (ACLs) are used to restrict access to sensitive data.
  • Intrusion detection and prevention systems are in place to monitor for malicious activity.
  • Regular backups are performed to protect against data loss. We use rsync for efficient backups.
  • The system is hardened according to CIS benchmarks.

Future Enhancements

  • Implement a distributed processing framework like Apache Spark for faster data analysis.
  • Explore the use of edge computing to perform more analysis locally at the sensor gateways.
  • Integrate with a GIS system for spatial data analysis.
  • Automate the deployment process using Ansible.

Main Page Server Administration Data Analysis Machine Learning Network Security Database Management Cloud Computing System Monitoring Troubleshooting Documentation Glossary Contact Us Privacy Policy Terms of Service


Intel-Based Server Configurations

Configuration Specifications Benchmark
Core i7-6700K/7700 Server 64 GB DDR4, NVMe SSD 2 x 512 GB CPU Benchmark: 8046
Core i7-8700 Server 64 GB DDR4, NVMe SSD 2x1 TB CPU Benchmark: 13124
Core i9-9900K Server 128 GB DDR4, NVMe SSD 2 x 1 TB CPU Benchmark: 49969
Core i9-13900 Server (64GB) 64 GB RAM, 2x2 TB NVMe SSD
Core i9-13900 Server (128GB) 128 GB RAM, 2x2 TB NVMe SSD
Core i5-13500 Server (64GB) 64 GB RAM, 2x500 GB NVMe SSD
Core i5-13500 Server (128GB) 128 GB RAM, 2x500 GB NVMe SSD
Core i5-13500 Workstation 64 GB DDR5 RAM, 2 NVMe SSD, NVIDIA RTX 4000

AMD-Based Server Configurations

Configuration Specifications Benchmark
Ryzen 5 3600 Server 64 GB RAM, 2x480 GB NVMe CPU Benchmark: 17849
Ryzen 7 7700 Server 64 GB DDR5 RAM, 2x1 TB NVMe CPU Benchmark: 35224
Ryzen 9 5950X Server 128 GB RAM, 2x4 TB NVMe CPU Benchmark: 46045
Ryzen 9 7950X Server 128 GB DDR5 ECC, 2x2 TB NVMe CPU Benchmark: 63561
EPYC 7502P Server (128GB/1TB) 128 GB RAM, 1 TB NVMe CPU Benchmark: 48021
EPYC 7502P Server (128GB/2TB) 128 GB RAM, 2 TB NVMe CPU Benchmark: 48021
EPYC 7502P Server (128GB/4TB) 128 GB RAM, 2x2 TB NVMe CPU Benchmark: 48021
EPYC 7502P Server (256GB/1TB) 256 GB RAM, 1 TB NVMe CPU Benchmark: 48021
EPYC 7502P Server (256GB/4TB) 256 GB RAM, 2x2 TB NVMe CPU Benchmark: 48021
EPYC 9454P Server 256 GB RAM, 2x2 TB NVMe

Order Your Dedicated Server

Configure and order your ideal server configuration

Need Assistance?

⚠️ *Note: All benchmark scores are approximate and may vary based on configuration. Server availability subject to stock.* ⚠️