System Goals
- Local-first inference and control — no external API dependency for core operation
- Deterministic execution pathways for critical actions (HID injection, file writes)
- Service composition with explicit trust boundaries
- Operator authority at all layers — OtherU acts, humans approve or override
Major Planes
1) Perception
Captures relevant state from the environment:
- JetKVM HDMI capture — real-time desktop screenshots via hardware bridge
- Vision resize pipeline — raw capture letterboxed to model input resolution for Fara
- Optical flow cursor tracking — Lucas-Kanade tracker, confidence-scored position
- RAG memory retrieval — LEANN (HNSW vector index) + LightRAG + GraphRAG for context
2) Reasoning
Builds plans from user intent and system state using a three-tier model selection system:
| Tier | Purpose | When used |
|---|---|---|
| FAST | Greetings, acks | Short, low-complexity prompts |
| DEFAULT | Conversation + memory retrieval | General queries (uses 80B MoE responder) |
| REASONING | Deep analysis | Complex multi-step reasoning (uses 14B R1 model) |
| COMPLEX | Tool execution loop | Any action requiring tool calls (Fara, coder, web, etc.) |
Routing path:
User Message
│
▼
OpenClaw (Telegram / Signal / WhatsApp)
│
▼
Coordinator (OpenAI-compatible API, port 8080)
│
├── preroute_by_pattern() ← rule-based fast-path
│ ├── screenshot request → execute directly
│ ├── desktop/KVM task → execute_desktop_tool()
│ ├── record request → execute directly
│ └── code / web / etc → route to specialist agent
│
├── openclaw_tool_decision() ← 8B orchestrator selects external tools
│
└── council_streaming()
├── needs_tools() → COMPLEX (orchestrator loop)
│ └── orchestrator selects tool → execute_tool_call()
└── otherwise → DEFAULT / REASONING / FAST
3) Action
Executes constrained operations through typed adapters:
- Desktop/HID actions — JetKVM WebRTC bridge → mouse move/click, keyboard type/press
- Coordinate translation — vision model coordinates inverse-transformed to real screen coordinates before HID injection
- Code execution — sandboxed shell commands via coordinator exec tool
- File operations — workspace read/write via local filesystem tool
- Messaging — post to Telegram topics, send alerts, create summaries
4) Memory
Stores context for continuity and retrieval:
- LEANN — HNSW vector index using
facebook/contrieverembeddings - LightRAG — Graph + vector hybrid retrieval, integrated in coordinator
- GraphRAG — Custom knowledge graph implementation
- Redis — Operational state: routing session pins, model memory tracking, GSD loop state
- Daily log files — raw session notes for rolling context
- Long-term memory file — curated knowledge, updated periodically
Reliability Model
- Health and dependency endpoints on every service (
/health,/dependencies/health) - All containers configured with
--restart unless-stopped - Routing policy with session pinning (configurable TTL) for conversational coherence
- Cascade routing — primary path failure falls back through candidate tiers
- Coordinator files bind-mounted from host — edit and restart to apply changes
WMMA Ops Integration (AMD gfx1151)
OtherU Core integrates the wmma_ops kernel stack for AMD gfx1151 (RDNA3.5 / Strix Halo):
- Architecture target: gfx1151 with ROCm WMMA paths
- Achieved throughput: 21.6 TFLOPS on AMD Radeon 8060S iGPU
- Kernel variants:
matmul_adaptive,matmul_zerocopy,matmul_asmOpt— selected by workload shape - Critical ROCm flags:
-fa 1(flash attention),--no-mmap,ROCBLAS_USE_HIPBLASLT=1 - Result: +40% prefill improvement over baseline ROCm configuration
Container Stack
| Container | Port | Purpose |
|---|---|---|
| otheru-coordinator | 8080 | Main FastAPI orchestrator (OpenAI-compatible) |
| otheru-orchestrator | 8010 | Nemotron-8B tool router |
| agent-responder | 8016 | 80B MoE model — user-facing responses |
| agent-reasoner | 8017 | 14B reasoning model — deep analysis |
| agent-coder | 8011 | Coding specialist |
| agent-fara | 8015 | Fara-7B vision model (loaded on demand) |
| otheru-leann | 9621 | RAG memory service |
| otheru-redis | 6379 | State management |
| hardware-bridge | 8005 | JetKVM WebRTC bridge |
| otheru-gsd | 8090 | Autonomous development loop |