Train agents on the jobs you should not keep doing.
Reports, audits, follow-ups, dashboards, summaries, checks, and recurring analysis become reusable workflows.
Guided self-serve repo or advisory implementation
Agent OS helps with the work around the "real" work: research, analysis, framing, documentation, presentation, decision support, and the repeatable tasks that drain time before you get to judgment. Paste START_HERE into Cowork, Claude Code, or Codex and the first conversation runs a guided diagnostic that chooses your lane, then builds your vault, dashboard, first agents, and routines through a nine-quest sequence, with watchers and retrieval as optional power tools at the end. The whole setup is built to feel like a forward-deployed engineer working beside you.
Free and public. The repo is the whole kit; click above to get it.
Repetition is the easiest doorway. The larger promise is helping you take on complex work that used to cost too much time, attention, or mental load.
Reports, audits, follow-ups, dashboards, summaries, checks, and recurring analysis become reusable workflows.
They gather sources, compare options, create structure, and give you a first artifact to refine.
Research, decisions, corrections, costs, and handoffs become memory instead of extra admin work.
You decide what matters, what will land with humans, and which recommendation is actually worth pursuing.
Agents collect inputs, read context, rank options, and turn loose ideas into something reviewable.
They compare paths, find risks, draft summaries, and create pages, decks, dashboards, or briefs.
The vault, retrieval, and save-ups keep lessons, decisions, corrections, and next steps available.
You do not burn the best part of your attention on setup, formatting, or rebuilding context.
You can explore more ideas, learn faster, and reach a useful yes or no sooner.
You stay focused on taste, strategy, relationships, and the real-world context agents do not have.
The co-pilot asks what kind of work you do, then recommends Cowork, Claude Code, Codex, or a blended setup.
Vault, handoffs, and lint keep the system from becoming another forgetful chat thread. Retrieval comes later, once the vault has enough worth searching.
Recurring jobs, status, costs, blockers, optional watchers, and agent ownership become visible instead of scattered.
The first wins are practical. From there, Agent OS can help with research, strategy, analysis, and presentation work that used to be hard to start.
Agent job: gather metrics, spot movement, write takeaways, and format a client-ready deck.
Agent job: consolidate exports, flag changes, explain drivers, and produce a summary.
Agent job: review open opportunities, draft next steps, and surface stalled deals.
Agent job: summarize notes, extract owners, update open loops, and prepare the next brief.
Agent job: read history, identify risks, draft a status update, and recommend next actions.
Agent job: gather official sources, compare programs, rank fit, write recommendations, and package a review page.
Agent job: turn a rough idea into a structured map, surface missing pieces, and create a visual artifact for review.
Agent job: organize references, build scene packets, draft options, track feedback, and preserve the creative trail.
The point is not to remove people from the work. It is to move them toward judgment, taste, relationships, and real-world decisions.
Research inputs, organize notes, run audits, update dashboards, compare programs, draft options, format pages, write first-pass presentations, and document what changed.
You decide which idea matters, what the real-world context changes, how it should land with colleagues or clients, and which option is actually worth pursuing.
Each step adds leverage without forcing you to understand the whole system on day one.
Use AI for a single question, draft, summary, or idea.
Store useful files, examples, decisions, style rules, and working notes in the vault.
Turn repeatable jobs and preferred output shapes into routines the agent can run again.
Use agents for research, comparison, planning, analysis, and first-pass artifacts.
Give different agents different jobs and prevent duplicate work.
Track status, costs, context, handoffs, and risks in a visible operating layer.
The unique value is turning AI use into an operating system with memory, continuity, quality control, spend awareness, and a way to keep complex work moving.
Prior work, decisions, files, examples, and handoffs become reusable context.
An optional local search index helps the agent pull relevant memory instead of asking the user to re-explain. No API keys needed.
Long-running work can continue across sessions without losing the plot.
Vault Lint, checks, examples, and evals keep the system connected and trustworthy.
A local HTML dashboard tracks setup progress, agents, and open loops from the first conversation. Connect Notion only if you want it.
You can investigate more ideas, learn faster, and get to a good or bad decision with less manual setup.
Research, decisions, corrections, and next steps become memory instead of disappearing after the session.
The agent does the prep and structure. The human brings taste, context, relationships, and consequence.
Organizer, Deep Worker, Enterprise Operator, Builder, or Team Lead: the diagnostic adapts pacing, examples, and lane to how you actually work.
The agent learns how you want summaries, reports, decks, QA, and follow-up to look.
The same templates, boot prompts, routines, and quality gates adapt to law, real estate, creative, or operations work. Verticalized packs are on the roadmap; the foundation ships today.
Paste START_HERE.md into Cowork, Claude Code, or Codex as your first message. Nine questions decide your lane, then the co-pilot writes the plan and builds through a Q0 to Q8 quest sequence, with a progress file and a local dashboard tracking every step.
The build anchors in a real use case, not a demo, and ends with a test that proves it worked today.
Organizer, Deep Worker, Enterprise Operator, Builder, or Team Lead. The profile sets pacing and examples.
Never, rarely, sometimes, or every day. The answer scores your lane (Cowork, Claude Code, Codex, or a blend), and the score outranks stated preference.
Where you already use AI, your operating system, and any Obsidian, Notion, or project folder you care about, so nothing gets overwritten.
Data and repos the system must avoid. Then the co-pilot proposes your first two agents and asks you to confirm or correct.
The diagnostic recommends the entry point that fits how you want to work. Cowork, Claude Code, Codex, and blended setups can all build the same memory-backed system.
Cowork can run the full setup flow. It feels most like a conversational digital teammate that explains choices, keeps you oriented, and guides the early build.
The co-pilot uses markdown playbooks and a few safe scripts to implement each layer in your own workspace.
Start with the plain-English version. When you want the deeper mechanics, open the stack and see how agents retrieve context, preserve continuity, and stay useful across sessions.
A prompt lands in Cowork, Claude Code, or Codex.
The agent decides which facts are needed: current project, rules, recent messages, or historical context.
With the optional index built, the agent searches it before broad file reads and gets likely source files with snippets.
After retrieval points to a source, the agent reads the specific file or section.
Fresh files, tests, git status, and your latest instruction can override older memory.
The response combines your prompt, retrieved memory, direct file reads, and live workspace state.
The human-readable source of truth for project notes, architecture, decisions, contacts, and session history.
The boot map that tells agents what exists, where key files live, and what state must be preserved.
An optional local index so agents can find relevant memory by keyword match. Semantic search is the documented upgrade path.
Status files, command notes, escalations, and events move work between agents.
A progress file plus a local HTML dashboard keep setup state visible in plain English.
Save-ups turn long conversations into next-version operating packets.
Checks structure so important notes do not become disconnected or invisible.
Security, release, and quality claims need scan output, test results, diffs, or verifier logs.
Credentials stay in environment variables or Keychain. Private chat and inbox content stay out of public collateral.
Retrieval is optional and keyword-based out of the box. Semantic search and richer aggregation are documented upgrade paths.
Mistakes become instruction changes, then get retested against future work.
Recurring lint, retrieval refresh, security sweep, and save-up checks keep the system healthy.
This is what you get by implementing the starter kit: operating routines that make agents remember, check, hand off, and improve.
Shared folders, BRAIN_INDEX, sessions, decisions, handoffs, and VaultBus coordination.
A default HTML quest tracker, with Notion as an optional command-center-style layer.
Agents write handoffs and next-session prompts before context quality drops.
A local starter routine so agents can find approved vault context without bulk-reading.
Checks links, orphans, generated files, and structural issues so the vault stays usable.
Token scans, history checks, watcher boundaries, and public-release gates.
Capture friction, lessons, patches, and retests so the setup improves after breakdowns.
Clear lanes for Cowork, Claude Code, Codex, and hybrid agent teams.
Create the folder structure, BRAIN_INDEX, session registry, and handoff lanes.
Optional local index and search scripts. No dependencies and no API keys out of the box.
Run quality checks so the vault stays usable as it grows.
Review agent output, friction, setup quality, and next improvements.
Set up sweeps for token families, API keys, risky scripts, and release gates.
Scan working tree and git history before anything becomes public.
Create the local quest dashboard first, then add Notion if you want a coordination hub.
Define recurring maintenance, security, lint, and status checks.
Use the guided repo yourself, or reach out to Yacob for advisory services to adapt the same setup to your team, tools, and workflows.
Use the public Agent OS Starter Kit with a setup co-pilot that walks through the decisions, files, and scripts like a forward-deployed engineer by your side.
Work with Yacob to design the polished version for your work, team, or company when you want an expert implementation partner.