Book a free strategy call — pick a time that works for you Book Now →
NemoClaw on Jetson AGX Thor edge AI agent deployment

NemoClaw on Jetson AGX Thor: Edge AI Agent Deployment Guide

“Getting Started with NVIDIA NemoClaw on Jetson AGX Thor — the factory floor does not have an internet connection. The agent has to work anyway.”

— ajeetraina.com, NemoClaw Jetson deployment guide, 2026

NemoClaw is NVIDIA’s enterprise security wrapper for agentic AI — the sandboxed runtime that adds kernel-level isolation, YAML policy enforcement, and privacy routing to the open-source OpenClaw agent framework. When NemoClaw runs on a Jetson AGX Thor at the network edge, it becomes an autonomous AI agent that operates without any cloud dependency. The inference engine runs locally on the Jetson’s integrated GPU. The sandbox runs locally. The policy engine runs locally. No tokens, no prompts, and no tool outputs ever leave the device.

Edge deployment is not a scaled-down version of cloud deployment. It is a fundamentally different operational model where there is no cloud dependency and all data stays on the device. The Jetson has no fallback — if the local model cannot handle a task, there is no cloud API to escalate to. The device may run in environments with no network connectivity at all: factory floors, mining operations, military installations, retail locations in rural areas. This article covers the technical path from unboxing a Jetson AGX Thor to running a production NemoClaw agent in a fully air-gapped configuration.

NemoClaw Supported Hardware Platforms

NemoClaw is designed to run across NVIDIA’s full hardware range: RTX PCs (consumer and workstation GPUs), DGX Spark (desktop AI computer), DGX Station (workstation-class multi-GPU), and NVIDIA-powered servers (data center and on-premises). Jetson AGX Thor extends this to edge and embedded deployments. The same NemoClaw sandbox, policy engine, and privacy router run on all platforms — the differences are in available compute, model size capacity, and the number of concurrent agents each platform supports.

For the hardware comparison that includes Jetson alongside DGX and RTX options, see our NemoClaw Hardware Guide. For the privacy router configuration that enforces local-only inference, see our NemoClaw Privacy Router deep dive. For information about ManageMyClaw and what we do, visit our About page.

0 cloud API calls — fully air-gapped capability
128 TOPS — Jetson AGX Thor’s Blackwell GPU compute
Section 1 • Problem

Why Edge AI Agents Exist: The Environments That Cannot Use Cloud

Cloud-based AI agents assume reliable, low-latency internet connectivity. This assumption fails in three categories of enterprise environments where NemoClaw edge deployment on Jetson solves a problem no cloud architecture can address.

Manufacturing and industrial. Factory floors in automotive, semiconductor, and heavy manufacturing facilities frequently operate on air-gapped networks for safety and IP protection reasons. SCADA systems, robotic assembly lines, and quality inspection stations are physically isolated from the internet. An AI agent that assists with predictive maintenance, defect classification, or production scheduling must run entirely on local compute.

Retail and field operations. Retail locations in rural areas, mobile field service vehicles, and pop-up operations often have intermittent connectivity. An AI agent that processes inventory, handles customer interactions, or assists field technicians needs to function when the network drops — and it needs to function fully, not in a degraded mode that prompts “please reconnect to continue.”

Defense and classified environments. Military and intelligence operations require systems that cannot, by regulation, connect to public cloud infrastructure. ITAR-controlled environments prohibit data from crossing certain network boundaries. These are not optional security preferences — they are legal requirements with criminal penalties for violations.

Section 2 • Hardware

Jetson AGX Thor: What the Hardware Provides for NemoClaw

The Jetson AGX Thor is NVIDIA’s highest-performance edge computing module. It uses a Blackwell-architecture GPU with 128 TOPS of AI compute, paired with an ARM-based CPU complex with up to 20 cores. The module supports up to 128GB of unified memory shared between CPU and GPU, which eliminates the PCIe bottleneck that limits discrete-GPU systems when swapping model weights between CPU and GPU memory.

Specification Jetson AGX Thor DGX Station (Air-Gapped)
GPU Architecture Blackwell (integrated) Blackwell (discrete, multi-GPU)
AI Compute 128 TOPS (INT8) 2+ PFLOPS (FP8)
Memory Up to 128GB unified Up to 640GB HBM3e
Power Consumption 15–100W (configurable) 1,500–3,000W
Form Factor Module (credit-card sized) Workstation tower
Air-Gapped Support Native — designed for edge Supported with configuration
Price Range ~$2,000–$5,000 (module + carrier board) $50,000+
Best For Single agent, 12B model, edge sites Multi-agent, 120B model, on-premises
DGX Station Also Supports Air-Gapped Configurations

If your edge requirement is driven by air-gap compliance rather than physical space constraints, the DGX Station is also a valid option. DGX Station supports fully air-gapped NemoClaw deployments with the 120B Nemotron model and multi-agent orchestration, including air-gapped configurations for strict regulatory environments such as ITAR, HIPAA, and classified workloads. The Jetson AGX Thor is for environments where physical size, power consumption, and cost are the primary constraints — factory floors, vehicle-mounted systems, retail kiosks.

Section 3 • Setup

Installing NemoClaw on Jetson AGX Thor

The Jetson runs JetPack OS, NVIDIA’s Linux-based operating system built on Ubuntu. NemoClaw’s OpenShell sandbox requires Linux kernel 5.13+ for Landlock support — JetPack 6.x includes kernel 5.15, which satisfies this requirement. The installation process differs from x86 NemoClaw in two ways: the ARM64 binary must be used, and the model quantization must be optimized for unified memory architecture.

Prerequisites

  • Jetson AGX Thor with JetPack 6.1+ installed
  • USB/Ethernet connection to a workstation (for initial setup only — the Jetson will be air-gapped after deployment)
  • Nemotron 3 Super 12B model weights pre-downloaded to a USB drive or network share
  • NemoClaw ARM64 binary and dependencies pre-packaged (offline installation)
  • 32GB+ unified memory configured in JetPack power profile

Step 1: Transfer NemoClaw and Model Weights (Pre-Air-Gap)

Terminal — Offline Package Transfer
# On the workstation: create an offline installation package
$ nemoclaw package create \
    --arch arm64 \
    --model nvidia/nemotron-3-super-12b \
    --quantization int4-awq \
    --include-dependencies \
    --output /mnt/usb/nemoclaw-jetson-package.tar.gz

# Transfer the package to the Jetson via USB
$ scp /mnt/usb/nemoclaw-jetson-package.tar.gz jetson@192.168.55.1:/opt/nemoclaw/

# On the Jetson: extract and install offline
$ cd /opt/nemoclaw
$ tar -xzf nemoclaw-jetson-package.tar.gz
$ sudo ./install.sh --offline --arch arm64

Step 2: Configure for Air-Gapped Operation

nemoclaw-config.yaml — Jetson Air-Gapped Configuration
providers:
  default: vllm-local
  vllm-local:
    endpoint: "http://127.0.0.1:8000/v1"
    model: "nvidia/nemotron-3-super-12b"
    max_tokens: 4096
    timeout_seconds: 180

privacy_router:
  mode: air-gapped
  block_all_external: true
  allowed_endpoints: []
  log_blocked_attempts: true

sandbox:
  mode: deny-by-default
  network:
    allow_loopback: true
    deny_all_external: true

telemetry:
  enabled: false
  phone_home: false
Disable Telemetry for Air-Gapped Deployments

NemoClaw’s default configuration sends anonymized telemetry to NVIDIA. In an air-gapped deployment, this causes silent connection failures and log noise as the telemetry client repeatedly attempts to reach NVIDIA’s servers. Set telemetry.enabled: false and phone_home: false to eliminate these failed connection attempts. If you forget this step, the agent will still function, but your logs will fill with connection timeout errors every 60 seconds.

Step 3: Verify the Air-Gapped Stack

Terminal — Air-Gap Verification
# Disconnect the network cable (or disable Wi-Fi)
$ sudo nmcli networking off

# Start vLLM with local model
$ vllm serve /opt/nemoclaw/models/nemotron-3-super-12b \
    --quantization int4-awq \
    --max-model-len 32768 \
    --gpu-memory-utilization 0.80 \
    --port 8000 \
    --host 127.0.0.1

# Start NemoClaw
$ nemoclaw start --config /opt/nemoclaw/nemoclaw-config.yaml

# Verify completely offline operation
$ nemoclaw status
Agent PID: 3412
Sandbox: ACTIVE (deny-by-default)
Provider: vllm-local (127.0.0.1:8000)
Privacy Router: air-gapped (all external blocked)
Network: OFFLINE (loopback only)
Status: RUNNING

# Test agent functionality without network
$ nemoclaw run --prompt "Analyze the sensor data in /data/sensors/latest.csv"
[sandbox] Reading /data/sensors/latest.csv (2.1MB)
[inference] Local vLLM response (3.2s, 847 tokens)
[result] Analysis complete. Report saved to /data/reports/analysis-2026-03-20.txt
Section 4 • Use Cases

Edge Deployment Patterns: Manufacturing, Retail, Industrial

Manufacturing: Predictive Maintenance Agent

A NemoClaw agent on Jetson AGX Thor monitors vibration sensors, temperature readings, and motor current draw from CNC machines on a factory floor. The agent runs a predictive maintenance model locally, identifying patterns that precede equipment failures. When the agent detects an anomaly, it generates a maintenance work order and pushes it to the local MES (Manufacturing Execution System) through a sandboxed API call to the factory’s internal network. No data leaves the facility.

predictive-maintenance-policy.yaml
agent: "maintenance-predictor"
sandbox:
  mode: deny-by-default
  filesystem:
    read:
      - "/data/sensors/"
      - "/data/equipment-specs/"
    write:
      - "/data/reports/"
      - "/data/work-orders/"
  network:
    allowed_endpoints:
      - "192.168.1.50:8080"  # Local MES system
    deny_all_other: true
  tools:
    allowed:
      - "csv_reader"
      - "anomaly_detector"
      - "work_order_generator"
      - "mes_api_client"

Retail: Inventory and Customer Assist Agent

A Jetson-mounted NemoClaw agent in a retail location processes natural language queries from staff about inventory levels, product specifications, and pricing. The agent reads from a local SQLite database synced nightly when the store’s network is available. During business hours, the agent operates fully offline. Staff ask questions through a local terminal or tablet; the agent responds with inventory counts, product comparisons, and reorder recommendations without any cloud API calls.

Industrial: Safety Compliance Agent

An industrial safety agent on Jetson processes camera feeds and sensor data to verify compliance with safety protocols — PPE detection, restricted zone monitoring, gas level alerts. The agent runs computer vision models locally alongside NemoClaw’s reasoning agent. When a violation is detected, the agent generates an incident report, triggers local alarms, and logs the event to the facility’s safety database. The entire pipeline runs without internet connectivity.

Section 5 • Operations

Operational Challenges: Updates, Monitoring, Model Refresh

Air-gapped deployments eliminate cloud dependency, but they also eliminate cloud convenience. Every operational task that cloud deployments handle automatically — software updates, model updates, log aggregation, monitoring alerts — must be handled through an offline process on edge devices.

Software updates. NemoClaw and vLLM updates must be packaged on a connected workstation, transferred to a USB drive, and installed on the Jetson during a scheduled maintenance window. There is no over-the-air update mechanism for air-gapped Jetsons. Build a quarterly update cadence and version-lock between NemoClaw, vLLM, and the Nemotron model to avoid compatibility issues.

Model refresh. When NVIDIA releases a new Nemotron model version, the update process is the same: download on a connected machine, transfer via USB, test on a staging Jetson, then deploy to production edge devices. Never update the model on a production edge device without testing first — a model that works on x86 may behave differently on ARM64 with different quantization settings.

Log collection. NemoClaw’s audit logs accumulate on the Jetson’s local storage. For compliance, these logs must be periodically collected — either through a scheduled network sync during maintenance windows, or through physical USB retrieval. Set log rotation to prevent storage exhaustion: the Jetson’s onboard storage is limited compared to server-class hardware.

Terminal — Log Rotation for Edge Deployment
# Configure NemoClaw log rotation on Jetson
$ nemoclaw config set logging.max_size_mb 500
$ nemoclaw config set logging.max_files 10
$ nemoclaw config set logging.rotation daily

# Export logs to USB for compliance collection
$ nemoclaw logs export \
    --from 2026-03-01 \
    --to 2026-03-20 \
    --output /mnt/usb/nemoclaw-logs-march-2026.tar.gz

# Verify log integrity
$ nemoclaw logs verify --file /mnt/usb/nemoclaw-logs-march-2026.tar.gz
Log integrity: VALID (SHA-256 checksums match)
Entries: 47,231
Date range: 2026-03-01 to 2026-03-20
Storage Alert: Jetson Has Limited Onboard Storage

The Jetson AGX Thor’s onboard eMMC or NVMe storage is typically 64–256GB. After the OS, NemoClaw, vLLM, and the Nemotron 12B model (approximately 8GB in INT4-AWQ), you may have 40–200GB remaining. Sensor data, logs, and generated reports accumulate. Monitor disk usage with df -h and set NemoClaw alerts at 80% capacity. Add an external NVMe drive via the carrier board’s M.2 slot if your use case generates significant data.

Section 6 • Limitations

What Jetson Edge Deployment Cannot Do

The Jetson AGX Thor is a powerful edge device, but it is not a DGX. The 12B Nemotron model runs well within the Jetson’s compute and memory envelope. The 120B model does not fit. This means edge agents are limited to the capabilities of the smaller model — which is still highly capable for structured tasks but does not match the 120B model’s performance on complex multi-step reasoning, long-form generation, or nuanced language understanding.

Multi-agent orchestration is possible but constrained. Running two or three agents concurrently with the 12B model will consume most of the Jetson’s unified memory. Five or more concurrent agents will require aggressive quantization (INT4 or lower) and reduced context windows, which degrades output quality. If your edge use case requires multi-agent orchestration at scale, the DGX Station in an air-gapped configuration is the appropriate hardware.

There is no fallback. If the local model cannot solve a problem, the agent cannot escalate to a larger cloud-based model. Design edge agent workflows for tasks where the 12B model consistently succeeds. Use the connected workstation during deployment planning to test every expected workflow against the 12B model before deploying to the air-gapped Jetson.

Reference • FAQ

Frequently Asked Questions

Can NemoClaw on Jetson connect to a local network without internet access?

Yes. Air-gapped does not mean network-less. The Jetson can connect to a local area network that has no internet gateway — factory LANs, building management networks, or private VLANs. NemoClaw’s privacy router policy controls which local endpoints the agent can reach. Set allowed_endpoints to include only the specific local IPs and ports the agent needs (MES systems, local databases, SCADA controllers) while keeping deny_all_other: true.

What is the power consumption of NemoClaw inference on Jetson AGX Thor?

The Jetson AGX Thor supports configurable power modes from 15W to 100W. During active NemoClaw inference with the 12B model, expect 60–80W sustained power draw. During idle periods (model loaded but no requests), power drops to 20–30W. This is compatible with industrial UPS systems, vehicle power supplies, and solar installations that cannot support the 1,500W+ draw of a DGX Station. Configure the power mode with nvpmodel based on your deployment’s thermal and power constraints.

How do I handle NemoClaw security patches on air-gapped Jetsons?

Security patches follow the same offline update process as regular updates. Subscribe to NVIDIA’s NemoClaw security advisories. When a patch is released, download it on a connected workstation, verify the checksum against NVIDIA’s published hash, transfer to USB, and install on the Jetson during a maintenance window. For critical vulnerabilities, expedite the update cycle. For air-gapped fleets with many Jetsons, build an internal patch staging server that Jetsons connect to during scheduled maintenance windows on the isolated factory LAN.

Planning an Edge NemoClaw Deployment? Our Enterprise team helps design air-gapped agent architectures, Jetson fleet management procedures, and offline update workflows for manufacturing, retail, and industrial environments. Learn About Our Services