4
Hosts
8
GPUs
60
Pods
GPU
Compute

NixOS Homelab
Infrastructure

A 4-host Kubernetes cluster with GPU compute, distributed storage, and GPU-accelerated AI inference workloads.

Scroll to explore

Infrastructure Meets Cloud Native

The Challenge

The Solution

78
CPU Cores
123GB
RAM
8
GPUs
8.4TB
Storage

Infrastructure Innovation

NFS Config Sync

Profile System

50+ Justfile commands organized by profile (desktop, gaming, k8s). Run 'just desktop' to rebuild desktop environment, 'just k8s' to manage cluster.

Custom NixOS Modules

Local flake with reusable modules: llm.nix (vLLM integration), cluster.nix (K8s setup). Composable infrastructure as code.

Operations & Automation

Automated Deployments

NixOS rebuilds trigger automatic service restarts. Kubernetes deployments via GitOps-style manifests.

Service Mesh

Flannel CNI for pod networking, CoreDNS for service discovery, statefulsets for databases. Prometheus + Grafana for monitoring and alerting.

Observability

Centralized logging with Promtail, metrics collection with Prometheus, visualization in Grafana. Alertmanager for incident response.

Cluster Capabilities

Hardware Infrastructure

78 CPU cores, 123GB RAM, 8 GPUs across 4 hosts, 8.4TB storage.

AI/ML Services

n8n for AI workflows, Qdrant for vector search, vLLM for local LLM inference, OpenAI-compatible API gateway with middleware (rate limiting, circuit breaker, Redis caching).

GPU Compute

Status: GPU Compute Ready
GPUs: NVIDIA (8GB) - available<br>NVIDIA (8GB) - available<br>NVIDIA (8GB) - available<br>NVIDIA (24GB) - available<br>NVIDIA (8GB) - available
Storage: HDD, NVMe
Ready for: AI inference workloads

Infrastructure Timeline

The Journey

1
Before Sept 2025

Windows + Proxmox

Dual-boot setup with Proxmox servers for testing, Windows as daily driver

🎯
September 2025

Killed Windows

Full commitment to Linux. Started distro hopping journey.

3
Sept 2025 - Feb 2026

OS Evolution

Omarchy (Arch-based) → CachyOS (optimized Arch) → NixOS (declarative + reproducible)

4
March 2, 2026

NixOS Initial Commit

First NixOS configuration. Single host with basic desktop + gaming

5
March 3, 2026

AI Gateway v1.0

OpenAI-compatible API, multi-GPU support

6
March 4, 2026

Gateway v2.0

Middleware architecture with circuit breaker, rate limiting, Redis caching

7
Mid-March 2026

Cluster Expansion

Added 3 more nodes. Implemented NFS config sync, profile system, 50+ Justfile commands

8
March 18, 2026

K8s Phase 1-3: Foundation

Control plane, Flannel CNI, CoreDNS, stateful services (GlitchTip PostgreSQL)

9
March 19, 2026

K8s Phase 4-5: Services & GPU

Stateless services (GlitchTip web/worker, SearXNG, n8n), GPU workloads (llama.cpp)

10
March 20, 2026

K8s Phase 6-7: Monitoring & GPU

Prometheus + Grafana monitoring, GPU compute workloads with 5 GPUs ready

🚀
March 21, 2026

Production Cluster Live

60+ pods running across 4 hosts. AI inference, monitoring, and GPU compute fully operational

Code Explorer

Code Patterns

Overview.nix
# NixOS Configuration Pattern
{
  # Declarative system configuration
  boot.loader.systemd-boot.enable = true;

  # Network setup
  networking.networkmanager.enable = true;

  # User management
  users.users.jkro = {
    isNormalUser = true;
    extraGroups = [ "wheel" "networkmanager" ];
  };

  # System packages
  environment.systemPackages = with pkgs; [
    vim git curl wget
  ];
}