The NeuroNetwork · Architecture

How it actually
works

Five coordinated components. One persistent brain. A custom model that lives in your browser. Here is what happens between “tell us what you're making” and your published draft.

↓ the NeuroNetwork

The Family

Five components.
One studio.

Most AI products are a chat box in front of someone else's model. RetroHub is the NeuroNetwork — five named layers, each doing the job it's best at, all working together.

The orchestrator · core component

NeuroCORE

Decides what runs where

Every chat turn, every image request, every voice call flows through NeuroCORE. It enforces the workflow stage (Create / Ship / Refine), picks the right provider, applies safety filters, and emits the brand label you see in the chat header. One chokepoint, every decision auditable.

Your persistent brain

NeuroCortex

Remembers everything you make

A typed, versioned knowledge graph. Voice nodes, style nodes, character bibles, project context, library items — all of it persistent, scoped to you, and pinned to specific versions so a system update never silently rewrites your in-flight work.

The inference layer

NeuroEngine

Runs the models — local + cloud

Quill (our 135M custom model) runs in your browser via WebGPU. Cloud frontier models (Grok, Gemini, Claude) handle Ship & Refine. NeuroEngine is the substrate that abstracts both behind one streaming interface.

The playbook library

NeuroSkills

Knows what 'great work' looks like

Each skill — Children's Book, Cold Email, SEO Cluster, Resume — is a 1000-1500 word playbook stored as a NeuroCortex node. At every turn, NeuroSkills selects the relevant playbook plus your in-scope voice and project context, then assembles the system prompt.

The studio surface

NeuroUI

Where you actually work

The Production Shell — chat-first, with collapsible Brief / Preview / Insights / Personality panels that fade in and out around the conversation. The Brain Window lets you edit NeuroCortex directly. The slash rail surfaces every skill, app, and shortcut.

Each name maps to a real architectural chokepoint in the codebase. The brand is the architecture.

A Session

What actually happens when you
sit down to write

Five turns. Five components. One coordinated flow.

NeuroUI

You tell NeuroUI what you're making.

A blog post about open-source. A children's book chapter where Maple meets the river. A cold outreach email. You type it into the chat input. NeuroUI captures the intent and hands the turn off.

NeuroCORE

NeuroCORE picks the path.

The dispatcher reads your tier, your workflow stage (Create / Ship / Refine), and the task hint. It decides whether this turn goes to the local model, a cloud model, or an image pipeline — and which one. The decision is logged and auditable.

NeuroSkills

NeuroSkills assembles the working set.

The relevant skill playbook loads (Children's Book, Cold Email, whatever). NeuroSkills walks your NeuroCortex, pulls your in-scope voice, project, and character nodes, and assembles a curated system prompt — version-pinned and deduplicated.

NeuroEngine

NeuroEngine drafts.

Quill — our custom 135M-parameter model — runs in your browser via WebGPU. Tokens stream into the chat panel at conversational pace. No round trip, no charge, no rate limit. The same NeuroEngine routes Ship turns to frontier cloud models when you commit.

NeuroCortex

NeuroCortex records what was learned.

The conversation feeds back into your knowledge graph. Tone corrections become candidate STYLE updates. Named characters become BOOK_CHARACTER nodes. Approved outputs become LIBRARY_ITEMs. Every session compounds — without you doing anything explicit to teach the system.

Why Drafting Looks Different

Your first draft isn't the model's best work.
It's the NeuroNetwork meeting you.

Other AI tools train you to expect the first response to be the best response. Type a prompt, get a draft, ask for refinement, done. The model is the variable.

RetroHub flips that. The model is a constant. What changes from turn to turn is your working set— the slice of NeuroCortex that NeuroSkills loads as context. Early turns have a less-converged working set than later turns. By the third or fourth draft, the same model is producing dramatically better output, because it finally knows what you actually want.

This is why drafting is free. You're not asking the model to try harder — you're building the context that makes it good. The first draft is the system meeting you. By the third draft, it's working for you.

When you click Ship, NeuroCORE hands that converged context off to a frontier model — and the difference between "chat with GPT" and "chat with the right context loaded" is the entire product.

The Compound Effect

The more you use it,
the better it gets.

Every session adds to your NeuroCortex. Every correction tunes your voice. Every project becomes a reference point for the next.

Persistent memory

Your NeuroCortex survives every session. The character you wrote yesterday is still there next month.

Versioned voice

Your VOICE node grows with you. Sessions pin to the version they were created against — no silent drift.

Project continuity

Pick up the children's book you started last summer. Same characters. Same tone. Loaded automatically.

The Business Model

Drafting is free because that's where your
brain takes shape.

We don't make money on you exploring. We make money on you shipping.

Always Free

What you get without paying

Unlimited drafting on Quill (NeuroEngine local)
Full NeuroCortex — no node caps
Every NeuroSkill (all 65+ playbooks)
Image generation (ComfyUI)
Export your data anytime

Pay Per Ship

What unlocks when you commit

Frontier model on Ship & Refine (NeuroEngine cloud)
Monthly generation budget
Choice of Grok, Gemini, or Claude
Targeted Refine on shipped outputs
Top-up packs never expire

See tier pricing

Inside NeuroEngine

We don't just resell tokens.
We trained our own model.

RetroHubAI-Quill-135M is our custom-trained draft model — the local-first half of NeuroEngine. A LoRA fine-tune of HuggingFace's SmolLM2-135M base, distilled on responses from grok-3-mini, built specifically for the conversational, production-studio voice this product needs.

It runs entirely in your browser via WebGPU + MLC WebLLM — small enough for consumer hardware, fast enough to feel conversational, and tuned for the specific job of drafting alongside NeuroCortex. Every drafting turn at every tier uses it. Free, unlimited, no round trip.

Off-the-shelf 135M models either feel robotic (Instruct variants) or generic (raw base). Quill is neither. It's trained for this product, for this user, for this moment in your conversation.

Model Card

RetroHubAI-Quill-135M

Parameters: 135M
Base: SmolLM2-135M
Method: LoRA distillation
Teacher: xAI Grok
Context: 1024 tokens
Runtime: WebGPU · MLC WebLLM
Cost to user: Free, always

Ready

Build your brain.
Then ship your work.

Drafting is free. NeuroCortex is unlimited. You only pay when you're ready to commit.

Enter the Atelier Try Free as Guest

No credit card required

Guest mode available

Secure & private

How it actuallyworks

Five components.One studio.

NeuroCORE

NeuroCortex

NeuroEngine

NeuroSkills

NeuroUI

What actually happens when yousit down to write

You tell NeuroUI what you're making.

NeuroCORE picks the path.

NeuroSkills assembles the working set.

NeuroEngine drafts.

NeuroCortex records what was learned.

Your first draft isn't the model's best work.It's the NeuroNetwork meeting you.

The more you use it,the better it gets.

Drafting is free because that's where yourbrain takes shape.

What you get without paying

What unlocks when you commit

We don't just resell tokens.We trained our own model.

Build your brain.Then ship your work.

How it actually
works

Five components.
One studio.

What actually happens when you
sit down to write

Your first draft isn't the model's best work.
It's the NeuroNetwork meeting you.

The more you use it,
the better it gets.

Drafting is free because that's where your
brain takes shape.

We don't just resell tokens.
We trained our own model.

Build your brain.
Then ship your work.