AI agents are powerful. Now make them predictable.

Structured workflows that make every agent run repeatable. Scoped steps. Verified outputs. Full observability.

Install the CLI View on GitHub

$ npm install -g @athenaflow/cli

Works with Claude Code and Codex today. More harnesses coming.

01 — workflow marketplace

Install workflows
like packages.

Browse, install, and share community-built workflows. Each one is scoped, tested, and ready to run. Author your own or install from the marketplace.

playwright-automation

v0.0.21

lespaceman/athena-workflow-marketplace

Writes, reviews, and stabilizes Playwright end-to-end tests from live product flows.

$ athena workflow install playwright-automation

Coverage plansTest implementationFlaky-test fixes

robot-automation

v0.0.21

lespaceman/athena-workflow-marketplace

Generates Robot Framework suites with keyword reuse, structured fixtures, and CI-ready runs.

$ athena workflow install robot-automation

Keyword librariesSuite scaffoldingCI integration

exploratory-testing

v0.0.14

lespaceman/athena-workflow-marketplace

Runs charter-based exploratory sessions, captures evidence, and files actionable issues.

$ athena workflow install exploratory-testing

Session chartersEvidence captureTriage-ready reports

smoke-testing

v0.0.14

lespaceman/athena-workflow-marketplace

Defines and executes smoke suites against critical paths so breakages surface before release.

$ athena workflow install smoke-testing

Critical-path mapPre-release smoke runPass/fail summary

regression-testing

v0.0.14

lespaceman/athena-workflow-marketplace

Scopes impact-based regression runs, reruns the right areas, and reports deltas against the last release.

$ athena workflow install regression-testing

Impact-based scopeRerun depth planRelease-over-release diff

02 — the problem

The problem

The agent is fine.
The workflow around it is broken.

Same prompt. Same model. Different output every time.
Agents drift over long runs. No checkpoints, no way to recover.
What works for one dev doesn’t transfer to the team.
You see the final output. Not the 40 tool calls that produced it.
Agent tooling assumes a human at the keyboard. CI and automation are afterthoughts.

The fix

One workflow layer
across every harness.

AthenaFlow sits between you and the agent. Define steps, constraints, and exit criteria once — run on Claude Code, Codex, or any future harness. Same workflow, same result.

01

Structure over prompting

Define jobs declaratively — steps, constraints, exit criteria — not conversationally.

02

Checkpoints, not hope

Every step is verifiable, every run is resumable. No silent drift.

03

Team-portable

What one developer builds transfers to the whole team. Same workflow, same result.

04

Pipeline-first

Built for CI from day one. Not interactive-first, retrofitted later.

03 — how it works

Install, run, ship.
Three steps.

Pick a workflow, run it, ship the result. Athena handles everything in between.

01. Pick a workflow

From marketplace or from scratch

Install a community workflow, point to a local file, or author one from scratch. One command to get started.

$ athena workflow install e2e-tests

✓ e2e-tests@1.2.0 installed

02. Run it

Run it anywhere. Athena handles the loop.

Interactive or headless. Athena manages permissions, checkpoints, and progress while the agent works.

$ athena-flow --workflow=e2e-tests

▸ claude-code | running

▸ 12 tool calls | 0 denied

03. Ship the output

Artifacts land where you need them

Every run is saved. Resume with full state. Pipe structured JSONL to files, CI artifacts, or downstream tools.

$ athena-flow resume

▸ restoring session abc-123

✓ saved to ./output/risk.md

04 — capabilities

One layer across
every harness.

Each harness has its own way. Athena unifies them into a single workflow model.

Declarative workflows

One file defines prompts, plugins, isolation, and model preferences — portable across any harness. Version and share like code.

Real-time event feed

A unified observability layer. Every tool call, permission, and decision streams live — same format, any harness.

Unified session model

One session format across harnesses. Resume, inspect, and replay any run regardless of which agent executed it.

Harness-agnostic

One workflow definition. Runs identically on Claude Code, Codex, and future harnesses.

CI-native execution

One headless interface for any harness. JSONL output, structured exit codes, and permission policies for your pipeline.

Portable plugins

Each harness has its own plugin system. Athena provides a single interface to author and run plugins across all of them.

Start building.

Install the CLI, run the setup wizard, pick a workflow.

$ npm install -g @athenaflow/cli

Read the docs View on GitHub