Air-Gapped Deployment

Fully isolated infrastructure with no internet connectivity. Every component — models, data, inference, tooling — runs inside a closed network with no external access.

System Component

Security Concern

Air-Gap Boundary

Physical Transfer

What Air-Gapped Means

An air-gapped deployment has no network connection to the outside world. No internet, no cloud APIs, no external DNS. Everything the system needs — models, data, updates — enters through controlled physical or one-way transfer mechanisms.

No outbound connections

The system cannot reach external APIs, model providers, package registries, or any internet endpoint. All inference happens locally on hardware you control.

No inbound connections

Nothing from the outside can reach the system. No webhooks, no remote management, no cloud sync. Access is physical or through a tightly controlled jump host on the isolated network.

Physical transfer only

Model weights, software updates, and data enter via verified physical media — encrypted USB drives, optical media, or hardware security modules with strict chain-of-custody procedures.

Security Surface

Air-gapping eliminates entire categories of network-based attacks but introduces unique operational security challenges. The attack surface shifts from network to physical and supply chain.

HIGH

Supply Chain — Model Integrity

Model weights must be downloaded externally and transferred in. If the transfer media is compromised — poisoned weights, backdoored model files, tampered checksums — the entire system is built on a corrupted foundation with no way to phone home for verification.

Mitigation: Multi-party verification of model checksums, cryptographic signing of transfer media, dedicated secure download workstation, model scanning tools run pre-transfer.

HIGH

Supply Chain — Software Updates

All software — OS patches, framework updates, dependency upgrades — must be transferred manually. A compromised update package that passes verification could introduce vulnerabilities with no network-based detection or rollback.

Mitigation: Signed package repositories mirrored internally, staged update testing on isolated test environment, binary reproducibility checks, vulnerability scanning pre-transfer.

HIGH

Insider Threat

With no network exfiltration path, the primary data loss vector becomes authorised personnel. Anyone with physical access to the air-gapped environment can potentially extract data via removable media, modified hardware, or covert channels.

Mitigation: Strict physical access controls, USB port disabling (hardware-level), removable media logging, two-person integrity for data transfers, TEMPEST-rated facilities where required.

MEDIUM

Prompt Injection

Open-source models in air-gapped environments have no provider-side guardrails. Prompt injection attacks may be more effective, and there's no ability to leverage cloud-based safety APIs or real-time model updates to patch vulnerabilities.

Mitigation: Local guardrail models (Llama Guard), input sanitisation layers, output validation pipelines, system prompt hardening, regular guardrail model updates via secure transfer.

MEDIUM

Stale Models & Data

Without internet access, models can't be updated automatically. Knowledge cutoffs become hard limits. Security patches for inference servers, frameworks, and dependencies require manual transfer — creating windows of known vulnerability.

Mitigation: Scheduled update cadence (weekly/monthly secure transfers), internal vulnerability tracking, compensating controls for known unpatched issues, model refresh schedule.

LOW

Covert Channel Exfiltration

Advanced adversaries may attempt data exfiltration through side channels — electromagnetic emissions, acoustic signals, power line modulation, or timing attacks on shared resources.

Mitigation: TEMPEST shielding where required, physical isolation of compute resources, power line filtering, acoustic dampening, regular security sweeps.

Architecture Considerations

Model Selection

Only open-source or licensed models that can be downloaded and transferred. No cloud-only models (Claude, GPT, Gemini) are available in a true air-gapped environment. Llama, Mistral, Qwen, and DeepSeek are the primary options.

Common choices: Llama 3 (70B/8B), Mistral Large, Qwen 2.5, DeepSeek-V3, CodeLlama for code tasks.

Inference Infrastructure

Self-hosted inference servers (vLLM, Ollama, TGI) running on local GPU clusters. All compute is on-premise. You size hardware for peak load since there's no cloud burst capacity.

Typical: NVIDIA A100/H100 or AMD MI300X clusters, with vLLM for throughput or Ollama for simplicity.

Internal Package Mirror

A local mirror of all required package registries — PyPI, npm, container registries, model repositories. Updated via secure transfer on a defined schedule. No pip install from the internet.

Tools: Artifactory, Nexus, private PyPI, Harbor for containers, internal Hugging Face mirror.

Data Ingestion

All external data enters through controlled transfer points with inspection, scanning, and validation. Internal data pipelines operate normally within the air-gapped network. RAG knowledge bases are populated via secure bulk transfers.

Pattern: Secure transfer workstation → scanning → staging → production import.

Monitoring & Logging

All observability stays internal. No cloud logging services, no external alerting. You run your own monitoring stack — Prometheus, Grafana, ELK, or equivalent — entirely on the isolated network.

Stack: Prometheus + Grafana for metrics, Loki or Elasticsearch for logs, local alerting.

Update Cadence

Establish a regular secure transfer schedule for model updates, security patches, and knowledge base refreshes. The gap between external availability and internal deployment is a managed risk.

Typical: Security patches weekly, model updates monthly, full system refresh quarterly.

Why Air-Gapped?

Maximum Data Protection

No data can leave the network — period. No accidental API calls, no telemetry, no DNS leaks. The strongest guarantee of data sovereignty available.

Regulatory Compliance

Required for classified government systems, certain healthcare environments, financial trading systems, and any context where data handling regulations mandate physical isolation.

Zero External Dependencies

The system operates independently of any external service. No provider outages, no API rate limits, no billing surprises, no policy changes affecting your operations.

Attack Surface Minimisation

Eliminating network connectivity removes the vast majority of attack vectors. No remote exploitation, no C2 callbacks, no data exfiltration over the network.

When Air-Gapping Isn't the Right Call

Air-gapped deployments carry significant operational overhead. They're the right choice for specific threat models — not a default recommendation.

You need frontier model capabilities

If your use case requires Claude, GPT-4, or Gemini-level reasoning, air-gapping isn't viable. Open-source models are excellent but don't match frontier models on complex tasks. Consider on-premise with a selective API approach instead.

Rapid iteration is critical

The secure transfer overhead slows development cycles significantly. If you're in an early prototyping phase, start with a cloud or on-premise deployment and migrate to air-gapped once the system is stable.

You don't have the operational capacity

Air-gapped environments require dedicated infrastructure teams, physical security controls, and rigorous transfer procedures. Without the personnel and processes to support it, the security benefits erode quickly.

Need an air-gapped AI deployment?

I help plan and build isolated AI infrastructure — from model selection and hardware sizing to secure transfer procedures and ongoing operations.

Start a Conversation Compare: On-Premise →