All Systems Operational

The Architecture

A multi-node infrastructure running local LLMs, semantic memory, agent orchestration, and home automation. Everything here is in active use—not experiments.

Workers

2,718+

Memories

57,373+

Auto-Generated Edges

Local Models

<1ms

Query Latency

Coquina Projects

Systems

Tools and infrastructure I design, build, and operate daily.

Coquina

Active

Self-Healing Memory Platform for AI Agents (formerly Cortex)

Memory that fixes itself. Every write passes A-MAC admission control — five scoring signals quality-gate what gets in, so the graph never fills with noise. Search runs two paths at once: Postgres full-text and ChromaDB vectors, ranked by recency. An auto-linker wires typed edges between memories with zero manual curation.

Four self-evolution subsystems keep it honest: embedding-drift detection, forgetting metrics, two-stage diffusion retrieval, and graph-diffusion consensus. It runs natively on Apple M4 Silicon with embedded ChromaDB — no Docker, no zombie containers — and heals across three tiers, back in about five seconds.

2,718+

Memories

57,373+

Edges

<1ms

Latency

Projects

PythonPostgreSQL 16ChromaDBOllamaMCPLangGraph

Read Case Study

Forge

Active

Queue-Based Autonomous Worker System

I built this to run while I sleep. 43 autonomous workers on Apple M4 Silicon, coordinated by Redis-backed queues and 81 macOS LaunchAgents — GPU scheduling, health monitoring, and overnight orchestration, all local.

Every night at 5 AM a 28-step DAG runs security review, code review, auto-fix with draft PRs, research scouts across arXiv and the ecosystem, memory consolidation, perf benchmarks, and a morning briefing. A self-healing daemon sweeps every 60 seconds and restarts anything that died.

The sandbox builder ships whole apps on its own: Ollama writes Swift, xcodegen builds it, and a fix loop closes the errors. Zero cloud.

Workers

LaunchAgents

Nightly

Chain

PythonRedismacOS LaunchAgentsSlack APIxcodegen

Read Case Study

Thalamus

Active

MQTT Gateway to AI Memory

The bridge between the physical house and the memory layer. Sensor readings, automation triggers, and system heartbeats ride MQTT off Home Assistant straight into Coquina, where agents can query the state of the real world like any other memory.

MQTTPythonHome Assistant

Nerve

Active

MCP Server Registry & Guardian

The guardian for the MCP layer. Nerve holds the canonical registry of server configs, watches for drift, verifies connectivity, and auto-restores anything missing or broken. When a config gets silently wiped, this is what puts it back — the structural safeguard that keeps the multi-agent stack intact.

TypeScriptNode.jsMCP

Forge Command Center

Active

Real-Time System Monitoring

One pane of glass over the whole runtime: live worker status, queue depths, chain run history, and system health. A React front end on the Forge API, auto-refreshing, with task dispatch built in — watch the fleet and push work to it from the same screen.

ReactTypeScriptFastAPI

Hook Pipeline

Active

6-Hook Auto-Persist System

Six Claude Code hooks fire across every session: context recall, a pre-tool safety gate, pre-tool injection, Slack commit alerts, pre-compaction memory extraction, and session-end persistence. Ollama reads each transcript, pulls out structured memories, and writes them to Coquina — no copy-paste, nothing lost between sessions.

ShellOllamaCoquina API

Local LLM Fleet

Active

30 Models on M4 Metal

30 models running locally on Apple M4 Silicon via Ollama. Each has a job: forge-gemma4-12b handles code review and local vision, qwen3.5:9b does overnight reasoning, gemma4:e4b runs fast structured/JSON and voice, and nomic-embed-text holds embeddings frozen. Add 5 custom LoRA adapters on top. It powers everything from Home Assistant voice control to autonomous code generation — zero cloud API calls.

Ollama 0.30.8LoRAApple M4 Metal

System Architecture

The homelab maps to brain structures—each system mirrors a cognitive function.

CoquinaHippocampusSelf-healing persistent memory. Dual-path search (Postgres FTS + ChromaDB vectors), A-MAC admission control on every write, and an auto-linked relationship graph. Four self-evolution subsystems keep it tuned. Native on Apple M4 Silicon with embedded ChromaDB — no Docker, three-tier healing, daily backups.

ForgePrefrontal CortexExecutive function and task orchestration. 43 autonomous workers, 81 LaunchAgents, GPU coordination, a 28-step overnight DAG, autonomous app generation, and a 60-second self-healing daemon. Runs the whole stack while I sleep.

ThalamusSensory GateThe intake valve for the physical world. Rides MQTT across every system event, filters the noise, and routes only what matters into Coquina and Slack.

Home AssistantMotor CortexPhysical-world actuation. Drives lights, cameras, sensors, and automation across the house.

NerveCorpus CallosumIntegrity enforcement for the MCP layer. Holds the canonical registry, detects config drift, and auto-restores broken or missing servers.

FrigateVisual CortexReal-time object detection on the camera feeds, pushing vision events into the pipeline.

Philosophy

Everything here exists because I use it. Coquina runs in every coding session. Forge orchestrates overnight chains while I sleep. The homelab hosts real services my household depends on.

Most “AI-powered” features are demos pretending to be products. This is where I test what actually works under real constraints: latency, failure modes, cost, and whether anyone would use it twice.

I build infrastructure so I can recommend it honestly. Every system I suggest to a client is one I've operated myself.

Interested in this kind of infrastructure?

I build these systems for clients too. Let's talk about what you need.

Start a Conversation