Neural Spine

The N is not a letter — it is an infinity, folded.

The cyan triangle is the fold — the point where infinite becomes operable.

AI-Inference First. The Neural Spine wires the LLM to your precision instruments, workflows, documents, and personas on a single nerve. Every feature answers one question: in the next iteration, how many human decision points does this remove?

The architecture is the product. Chat Engine routes tools. Workflow Engine executes DAGs. Event Bus chains them. 5 circuit breakers hold the line when any subsystem stalls.

Chat Engine
Workflow DAG
Event Bus
BullMQ + Redis
LibSQL · per-user
MCP custom servers

04.0 Numbers · What you can verify

Field detection → manager LINE

Instrument protocols supported

Sovereign · your cloud

0engines

YOLO + Vision LLM cross-validate

04.1 How it works · Surface · Spine · Memory

Three layers, one nerve.

A user types in chat or a sensor pushes a frame. The spine routes, executes, and audits. Memory persists what survives the gate. That is the whole product.

SURFACE

Where your team types or your sensor pushes. Same chat for sourcing audits and field disease scans. Same API for any precision instrument.

SPINE

Where decisions get made. Workflow Engine runs your DAG, Event Bus chains workflows, five circuit breakers keep one stalled subsystem from wedging the rest.

MEMORY

Three-gate Gatekeeper validates before anything persists. Below threshold goes to human review. Per-user namespace, AES-256-GCM at rest. Your data never crosses tenants.

04.2 Circuit breakers · 5 lanes, no exceptions

No subsystem is unbreakable.

Every external dependency lives inside a breaker. Open → fast-fail. Recovered → half-open probe. A subsystem without a breaker doesn't earn its way to production.

01 llm

LLM

Provider failover. Multi-pod safe. Backs off + recovers automatically.

02 mcp

MCP

Per-server isolation. One bad MCP server cannot starve the others.

03 event bus

Event Bus

Wildcard subscriber storm protection. Slow listener never wedges the hot path.

04 audit

Audit

Persist queue + guarded writes. Causal chain never lost on DB stall.

05 quota

Quota

Per-workspace caps. One workspace cannot exhaust shared LLM budget.

04.3 Memory Gatekeeper · Three-gate confidence scoring

What gets remembered, you approve.

Memory persistence is the slowest possible drift. We refuse to write anything until three gates pass. Below threshold goes to Gatekeeper queue for human approval. Memory you can trust.

Extraction gate

Is there an extractable claim? Filters small-talk, noise, transient state.

Classification gate

Preference / Fact / Episode / Relationship — refuses if type is unclear.

Confidence gate

Score against existing memory + conflict detection. Below threshold → human review.

→

Persisted

All three gates passed. Memory enters long-term store, scoped by workspace + user. Conflict detection alerts on contradictions.

04.4 Bring your own model · Configuration-time choice

Pick the model that fits your audit posture.

A curated default ships out of the box. Swap in any OpenAI-compatible endpoint at configuration time — your model, your endpoint, your audit log. Provider neutrality is the rule; the supply-chain detail lives on the Security page.

OpenAI Built-in

Anthropic Built-in

Claude Built-in

Groq Built-in

Google Gemini Built-in

OpenRouter Built-in

Ollama Built-in · self-hosted

Custom (BYO) Customer endpoint · runtime

NEUTRAL UX

Custom provider surfaces in the UI as Custom. No origin is celebrated or suppressed at the surface.

AUDITABLE

Every LLM call carries a provider tag into the audit log. Customers can verify which endpoint actually answered.

SOVEREIGN

Inference stays inside the deployment. No third-party hop, no data egress, no vendor lock-in.

Architect the spine. Personas come for free.

A platform is what it can refuse to break. We architect for the bad day, not the demo day.

Talk to us → Security & Origin Policy