AgentOS has three core concepts that determine what it can do and how it connects to a target machine. While the concepts below involve OS-level services, kernel drivers, and hardware interfaces, you don’t need to understand these internals to use AgentOS. The installer handles the complexity — non-technical users can set up AgentOS through a guided UI without touching configuration files or the command line. AgentOS is platform independent — it supports Windows, macOS, Linux, Android, and iOS Simulators (experimental), so the same agent code works across operating systems.Documentation Index
Fetch the complete documentation index at: https://docs.askui.com/llms.txt
Use this file to discover all available pages before exploring further.
Control Modes
Control modes define how AgentOS connects to the target machine. There are two:- Host Mode — AgentOS runs directly on the target as software. It uses OS-level APIs for screenshots, input, and window management. This is the most common setup.
- Companion Mode — AgentOS runs on a separate device and controls the target through hardware (USB, Bluetooth, HDMI capture) or device bridges (ADB, IDB). No software installation on the target is needed.
Runtime Modes
Runtime modes define how AgentOS runs on the machine it’s installed on. There are two:- Standalone — AgentOS runs as a regular process in the user’s session. Install via
pip install askui-agent-os. Best for local development and testing. - OS Service — AgentOS runs as a Windows system service with SYSTEM privileges. Install via the Service. Best for CI/CD, headless VMs, and enterprise deployments.
Capabilities
Capabilities are the building blocks your agents use — screenshots, keyboard input, mouse control, window management, process management, and more. All capabilities are optimized for token cost efficiency, low latency, and cross-device communication. What’s available depends on the combination of control mode and runtime mode:| Host Mode (Standalone) | Host Mode (OS Service) | Companion Mode | |
|---|---|---|---|
| Screenshots | Yes | Yes | Yes (HDMI capture) |
| Keyboard & mouse | Yes | Yes | Yes (USB/Bluetooth HID) |
| Window & process management | Yes | Yes | — |
| RDP resilience | — | Yes | — |
| Logon screen & CTRL+ALT+DEL | — | Yes | — |
| Mobile devices (ADB/IDB) | — | — | Yes |
Administrator Rights (Windows)
Applies to Host Mode only. In Companion Mode, AgentOS runs on a separate device and reaches the target through hardware bridges (USB, Bluetooth, HDMI capture) or device protocols (ADB, IDB), so Windows privileges on the target are not relevant.
Session 0 vs Interactive Session: Windows runs services in Session 0 — an isolated, non-interactive session with no desktop. Users log in to Interactive Sessions (Session 1, 2, …) where the desktop and GUI applications live. The AgentOS OS Service runs in Session 0 but attaches to the user’s interactive session to control the desktop. This is why it can keep automating even when no user is logged in or an RDP session disconnects.
Standalone
| Component | Default | Elevated |
|---|---|---|
| AgentOS | Not elevated (recommended) | Run with admin rights |
| Remote Device Controller | Not elevated (recommended) | Run with admin rights |
OS Service
| Component | Default | Configurable |
|---|---|---|
| AgentOS | Elevated (SYSTEM) | — |
| Remote Device Controller | Not elevated (recommended) | SERVICE_EXECUTION_ENGINE_ELEVATED=1 |
How They Fit Together
Think of it as two independent choices:- Runtime mode — How does the AgentOS binary start, and with which privileges? Started by an external program (Standalone, user privileges) or registered as a system service (OS Service, SYSTEM privileges).
- Control mode — What does it control once running? The local host (Host) or a separate device through a protocol (Companion).
Control Modes
Host Mode vs Companion Mode.
Runtime Modes
Standalone vs OS Service.
Capabilities
Full capability matrix.