HTTP API

The CTXone Hub exposes a REST API over HTTP when run with --http. This doc lists every endpoint, its request format, response format, and any query parameters.

All endpoints live under http://<host>:<port>/api/. Default host and port: 0.0.0.0:3001. CORS is enabled with Allow-Origin: *.

Conventions

Branch/ref parameter: most endpoints take a branch name in the URL path ({ref_name}) or as a ref query string / body field. Defaults to main.
Content type: requests and responses use application/json.
Error responses: HTTP 4xx for bad input, 5xx for server errors. Body is plain text with a human-readable message.

Health

`GET /api/health`

Simple liveness check.

Response (200):

{
  "status": "ok",
  "service": "ctxone-hub"
}

Used by ctx status and ctx doctor.

Stats

`GET /api/stats/tokens`

Cumulative token savings aggregated across every session.

Response (200):

{
  "session_id": "_aggregate",
  "session_tokens_used": 98,
  "session_tokens_saved": 1706,
  "total_graph_size_chars": 1804,
  "total_graph_size_tokens": 451,
  "cumulative_ratio": 18.43
}

session_id — always "_aggregate" to signal this is a roll-up, not a single-session snapshot
session_tokens_used — sum of tokens actually sent across all sessions
session_tokens_saved — sum of (recalls × flat_baseline) - used across all sessions
total_graph_size_chars — max observed across sessions (graph size is process-global, not summable)
total_graph_size_tokens — chars ÷ 4
cumulative_ratio — (used + saved) / used

`GET /api/stats/tokens/{session_id}`

Stats for a single logical session. session_id is whatever clients pass in the X-CTXone-Session header; absent clients roll up under "default".

Response (200):

{
  "session_id": "[email protected]",
  "session_tokens_used": 42,
  "session_tokens_saved": 658,
  "total_graph_size_chars": 1804,
  "total_graph_size_tokens": 451,
  "cumulative_ratio": 16.67
}

Returns 404 if the session ID has never been seen. Sessions are created lazily the first time a read endpoint (recall, context) records token usage for them.

`GET /api/stats/sessions`

List every known session with its current stats.

Response (200):

[
  { "session_id": "[email protected]", "session_tokens_used": 42, "session_tokens_saved": 658, "total_graph_size_chars": 1804, "total_graph_size_tokens": 451, "cumulative_ratio": 16.67 },
  { "session_id": "[email protected]",   "session_tokens_used": 120, "session_tokens_saved": 1200, "total_graph_size_chars": 1804, "total_graph_size_tokens": 451, "cumulative_ratio": 11.00 },
  { "session_id": "default",           "session_tokens_used": 0,   "session_tokens_saved": 0,    "total_graph_size_chars": 1804, "total_graph_size_tokens": 451, "cumulative_ratio": 0.0 }
]

Sorted by session_id. The "default" session is always present even on a fresh Hub.

`POST /api/stats/llm_usage`

Record one LLM turn’s token usage against the caller’s session. Agents call this after each significant LLM turn with numbers copied straight from the provider response’s usage field. Returns the updated SessionSnapshot so callers see running totals in one round trip.

The session is resolved via X-CTXone-Session (same mechanism as every other endpoint). Unknown sessions are auto-created.

Request body:

{
  "input_tokens": 2400,
  "output_tokens": 450,
  "cache_read_tokens": 1800,
  "cache_create_tokens": 600,
  "model": "claude-sonnet-4.5",
  "provider": "anthropic"
}

input_tokens (required) — tokens the model consumed as input
output_tokens (required) — tokens the model generated
cache_read_tokens — tokens served from the prompt cache (Anthropic), default 0
cache_create_tokens — tokens written to the prompt cache (Anthropic), default 0
model — human-readable model identifier for display, optional
provider — provider identifier (anthropic, openai, gemini, …), optional

All token fields are u64; negative or malformed values are rejected by the JSON parser.

Response (200):

{
  "session_id": "[email protected]",
  "session_tokens_used": 12,
  "session_tokens_saved": 340,
  "total_graph_size_chars": 1804,
  "total_graph_size_tokens": 451,
  "cumulative_ratio": 29.33,
  "llm_input_tokens": 2400,
  "llm_output_tokens": 450,
  "llm_cache_read_tokens": 1800,
  "llm_cache_create_tokens": 600,
  "llm_call_count": 1,
  "last_model": "claude-sonnet-4.5",
  "last_provider": "anthropic"
}

Error responses:

400 Bad Request (or 422 Unprocessable Entity, depending on axum’s extractor) when input_tokens or output_tokens are missing, non-numeric, or negative.

Recall integration: once a session has reported LLM usage at least once, every subsequent GET /api/memory/recall from the same session carries a session_llm_stats sub-object so agents see the running totals alongside the results:

{
  "results": [...],
  "ctx_tokens_sent": 300,
  "ctx_tokens_estimated_flat": 1500,
  "ctx_savings_ratio": 5.0,
  "pinned_count": 2,
  "topic_matches": 3,
  "session_llm_stats": {
    "input_tokens_total": 12500,
    "output_tokens_total": 3200,
    "cache_read_tokens_total": 8900,
    "cache_create_tokens_total": 450,
    "call_count": 17
  }
}

The field is only present for sessions that have reported usage — sessions that haven’t see the same shape they’ve always seen.

`GET /api/stats/{ref_name}`

Structural stats for a branch.

Response (200):

{
  "commit_count": 27,
  "path_count": 21,
  "branch_count": 2,
  "epoch_count": 0,
  "agents": ["ctxone", "ctxone-prime"],
  "categories": ["Checkpoint", "Custom(\"Observe\")"],
  "latest_commit": {
    "id": "sg_e762325fed96",
    "timestamp": "2026-04-14T17:47:43Z",
    "agent": "ctxone",
    "intent": "fact description"
  }
}

Read endpoints (state)

`GET /api/state/{ref_name}?path=<path>`

Read a value at a specific path.

Query params:

path — JSON path to read (default: /)

Response (200): the value at that path, pretty-printed JSON.

`GET /api/state/{ref_name}/paths?prefix=<prefix>&max_depth=<n>`

List all paths under a prefix.

Query params:

prefix — path prefix (default: /)
max_depth — max tree depth (default: 50)

Response (200): array of path strings.

["/memory/licensing/abc", "/memory/architecture/def", ...]

`GET /api/state/{ref_name}/search?query=<q>&max_results=<n>`

Literal substring search across values and keys.

Query params:

query — substring to match (case-insensitive)
max_results — max results (default: 50)

Response (200):

[
  {"path": "/memory/licensing/abc", "value": "CTXone uses BSL-1.1"},
  ...
]

Log and blame

`GET /api/log/{ref_name}?limit=<n>`

Recent commit history.

Query params:

limit — max commits (default: 20)

Response (200): array of commits. See the log response schema in CLI_REFERENCE.md.

`GET /api/blame/{ref_name}?path=<path>`

Provenance chain for a specific path.

Query params:

path — path to blame

Response (200): array of blame entries with commit id, agent, timestamp, intent, and confidence.

`GET /api/diff?ref_a=<a>&ref_b=<b>`

Diff two refs.

Query params:

ref_a — first ref (usually older / base)
ref_b — second ref (usually newer / target)

Response (200):

{
  "ref_a": "main",
  "ref_b": "experiment",
  "ops": [
    {"op": "AddKey", "path": "/memory/test", "key": "abc", "value": "..."},
    {"op": "SetValue", "path": "/...", "old": {...}, "new": {...}},
    {"op": "RemoveKey", "path": "/...", "key": "..."}
  ]
}

Op tags: SetValue, AddKey, RemoveKey, AppendItem, RemoveItem.

Branches

`GET /api/branches`

List all branches.

Response (200):

[
  {"name": "main", "id": "sg_e762..."},
  {"name": "experiment", "id": "sg_a3b1..."}
]

`POST /api/branches`

Create a new branch.

Request body:

{
  "name": "experiment",
  "from": "main"
}

Response (200):

{
  "status": "ok",
  "name": "experiment",
  "from": "main",
  "commit_id": "sg_a3b1..."
}

Memory endpoints (the high-level API)

These are the endpoints CTXone’s memory layer adds on top of the underlying state primitives.

`POST /api/memory/remember`

Store a fact.

Request body:

{
  "fact": "CTXone uses BSL-1.1 licensing",
  "importance": "high",
  "context": "licensing",
  "tags": ["legal", "decision"],
  "ref": "main"
}

fact (required) — the string to store
importance — high / medium / low (default medium). Maps to confidence 0.95/0.7/0.4.
context — category name; storage path is /memory/<context>/<id>
tags — queryable tags stored on the commit
ref — branch to write to (default main)

Response (200):

{
  "status": "ok",
  "ref": "main",
  "path": "/memory/licensing/18a6...",
  "commit_id": "sg_e762..."
}

`POST /api/memory/forget`

Delete a memory at a specific path.

Request body:

{
  "path": "/memory/licensing/18a6...",
  "reason": "superseded by new policy",
  "ref": "main"
}

Marked in blame as a Rollback intent with the given reason.

Response (200):

{
  "status": "ok",
  "ref": "main",
  "path": "/memory/licensing/18a6...",
  "commit_id": "sg_next..."
}

`GET /api/memory/recall?topic=<t>&budget=<n>&ref=<r>`

Retrieve memories for a topic. Pinned-first, token-scored, budget-capped.

Query params:

topic — query string (tokenized, multi-word supported)
budget — max token budget (default 1500)
ref — branch (default main)

Response (200): see the recall response schema in CLI_REFERENCE.md.

Every recall updates the session token counters — each call’s sent contributes to session_tokens_used on GET /api/stats/tokens.

`GET /api/memory/context/{project}?ref=<r>`

Load the full context tree for a project.

Response (200):

{
  "project": "myproject",
  "ref": "main",
  "context": {
    "status": "active",
    "decisions": {...}
  },
  "ctx_tokens_sent": 234,
  "ctx_tokens_estimated_flat": 1191
}

`POST /api/memory/prime`

Load structured sections as pinned or searchable memory.

Request body:

{
  "source": "project",
  "pinned": true,
  "sections": [
    {"title": "The Insight", "body": "..."},
    {"title": "The Roadmap", "body": "..."}
  ],
  "ref": "main"
}

source (required) — group name; re-priming the same source overwrites
pinned — if true, always include in recall; otherwise searchable (default false)
sections — parsed markdown sections from the client
ref — branch (default main)

Response (200):

{
  "status": "ok",
  "ref": "main",
  "source": "project",
  "pinned": true,
  "sections_written": 5,
  "paths": [
    "/memory/pinned/project/the-insight",
    "/memory/pinned/project/the-roadmap",
    ...
  ]
}

`GET /api/memory/pinned`

List all pinned memories.

Response (200):

[
  {"path": "/memory/pinned/project/the-insight/title", "value": "The Insight"},
  {"path": "/memory/pinned/project/the-insight/body", "value": "..."},
  ...
]

Clients typically group these by /memory/pinned/<source>/<slug> and pair the /title and /body children to reconstruct structured sections. Returns an empty array (not 404) when no pinned memories exist.

`POST /api/memory/summarize_session`

End-of-session commit capturing what was learned.

Request body:

{
  "session_id": "2026-04-14-afternoon",
  "key_points": ["Shipped Postgres backend", "Built auth middleware"],
  "decisions": ["SaaS as on-ramp", "agent memory is top priority"]
}

Response (200):

{
  "status": "ok",
  "session_id": "2026-04-14-afternoon",
  "key_points": 2,
  "decisions": 2
}

`GET /api/memory/what_changed_since?since=<iso>`

Recent commits filtered to those after a timestamp.

Query params:

since — ISO 8601 timestamp (e.g., 2026-04-12T00:00:00Z)

Response (200): array of commit summaries.

`GET /api/memory/why_did_we?decision=<text>`

Search for a decision and return its blame chain.

Query params:

decision — substring of the decision to look up

Response (200):

{
  "decision": "use BSL-1.1",
  "traces": [
    {
      "path": "/memory/licensing/abc",
      "blame": [...]
    }
  ]
}

Error responses

Status	Meaning	Example body
400	Malformed request (missing required field)	`"missing field \`fact`“`
404	Path or ref not found	`"ref not found: experiment"`
500	Internal error (storage, engine)	`"tree error: ..."`

The body is plain text, not JSON. Clients should log and retry on 5xx.

Rate limiting

The Hub enforces a per-peer-IP token-bucket rate limit in HTTP mode. Default: 600 requests/minute per IP (permissive — catches runaway loops without bothering real agents).

Clients that exceed the bucket get:

HTTP/1.1 429 Too Many Requests
Retry-After: 3
X-RateLimit-Limit: 600
X-RateLimit-Remaining: 0

Configure via --rate-limit-rpm <N> or the CTXONE_RATE_LIMIT_RPM env var. 0 disables rate limiting entirely. See docs/TROUBLESHOOTING.md#rate-limiting for details.

Per-session token tracking

Send X-CTXone-Session: <id> on any request to have its token usage counted under that session. Absent the header, usage rolls up under the "default" session. Per-session stats are exposed via:

GET /api/stats/tokens/{session_id} — single-session snapshot
GET /api/stats/sessions — all sessions
GET /api/stats/tokens — cross-session aggregate (backward-compat)

The Python client accepts a session_id constructor arg or reads CTX_SESSION_ID from the environment.

Per-tool agent IDs

Send X-CTXone-Agent: <name> on any write request (remember/forget/prime/summarize_session/merge) to stamp the commit with that agent ID. ctx blame and /api/log/{ref} responses surface this as agent_id, so you can tell which tool wrote each fact.

Absent the header, commits are attributed to "ctxone". The Python client accepts an agent_id constructor arg or reads CTX_AGENT_ID from the environment; the Hub binary accepts --agent-id <name> for MCP stdio mode (which is what ctx init wires into the generated .mcp.json / .cursor/mcp.json etc).

See docs/TROUBLESHOOTING.md#per-tool-agent-ids for the full resolution order and examples.

Plan endpoints

All plan endpoints live under /api/plans/* and honor X-CTXone-Agent for blame attribution + X-CTXone-Session for stats. A ref query parameter selects the branch (default main).

`POST /api/plans`

Create a plan.

POST /api/plans
{
  "name": "website-v2",
  "description": "Brand pivot",
  "ref": "main"
}
→ 201 Created
{
  "name": "website-v2",
  "description": "Brand pivot",
  "status": "active",
  "created_by": "claude-code",
  "created_at": "2026-04-16T…",
  "task_counts": { "pending": 0, "in_progress": 0, "done": 0, "abandoned": 0, "total": 0 }
}
→ 409 Conflict  (plan already exists)

`GET /api/plans?ref=main&status=active`

List plans on a branch, optionally filtered by status. Response body is a JSON array of plan objects.

`GET /api/plans/{name}?ref=main`

Fetch one plan with its full tasks[] list.

`DELETE /api/plans/{name}?ref=main`

Remove a plan destructively. Use POST /api/plans/{name}/archive for a soft, reversible alternative.

`POST /api/plans/{name}/tasks`

Add a task. Body fields:

Field	Type	Required
`title`	string	yes
`description`	string	no
`priority`	`low`/`medium`/`high`/`critical`	no
`parent_id`	string	no (subtask support)
`assigned_to`	string	no — agent id
`blocked_by`	string[]	no
`ref`	string	no

Returns the created task on 201.

`GET /api/plans/{name}/tasks?ref=main`

List tasks in a plan, flat.

`GET /api/plans/{name}/tasks/{task_id}?ref=main`

Fetch a single task.

`POST /api/plans/{name}/tasks/{task_id}/start`

Transition pending → in_progress. Returns the updated task. Returns 409 Conflict if blockers aren’t done.

`POST /api/plans/{name}/tasks/{task_id}/complete`

Transition in_progress → done with a proof:

{ "proof": { "kind": "commit", "value": "ef6ce63" } }

Proof kind is one of commit / file / test / text. Returns 400 Bad Request when the proof value is empty or the kind is unknown.

`POST /api/plans/{name}/tasks/{task_id}/abandon`

Body: { "reason": "superseded" }. Reason is required (empty reasons return 400).

`POST /api/plans/{name}/archive`

Soft-archive a plan.

`GET /api/plans/{name}/next?ref=main&assigned_to=me&include_unassigned=true&assigned_only=false`

Return the highest-priority pickable task wrapped as { "task": { … } } or { "task": null }. Pass assigned_to=me to filter to tasks assigned to the agent carried by X-CTXone-Agent — this is the state-driven orchestration primitive.

Authentication

The HTTP API currently has no authentication. Run the Hub on a trusted network (loopback, VPN, or private subnet) or put a reverse proxy in front with whatever auth layer you already use.

Multi-tenant auth is tracked as future work — see the engine’s agentstategraph-mcp binary, which supports --auth and --keys-file for tenant isolation. CTXone Hub doesn’t currently expose these.

HTTP API

Conventions

Health

GET /api/health

Stats

GET /api/stats/tokens

GET /api/stats/tokens/{session_id}

GET /api/stats/sessions

POST /api/stats/llm_usage

GET /api/stats/{ref_name}

Read endpoints (state)

GET /api/state/{ref_name}?path=<path>

GET /api/state/{ref_name}/paths?prefix=<prefix>&max_depth=<n>

GET /api/state/{ref_name}/search?query=<q>&max_results=<n>

Log and blame

GET /api/log/{ref_name}?limit=<n>

GET /api/blame/{ref_name}?path=<path>

GET /api/diff?ref_a=<a>&ref_b=<b>

Branches

GET /api/branches

POST /api/branches

Memory endpoints (the high-level API)

POST /api/memory/remember

POST /api/memory/forget

GET /api/memory/recall?topic=<t>&budget=<n>&ref=<r>

GET /api/memory/context/{project}?ref=<r>

POST /api/memory/prime

GET /api/memory/pinned

POST /api/memory/summarize_session

GET /api/memory/what_changed_since?since=<iso>

GET /api/memory/why_did_we?decision=<text>

Error responses

Rate limiting

Per-session token tracking

Per-tool agent IDs

Plan endpoints

POST /api/plans

GET /api/plans?ref=main&status=active

GET /api/plans/{name}?ref=main

DELETE /api/plans/{name}?ref=main

POST /api/plans/{name}/tasks

GET /api/plans/{name}/tasks?ref=main

GET /api/plans/{name}/tasks/{task_id}?ref=main

POST /api/plans/{name}/tasks/{task_id}/start

POST /api/plans/{name}/tasks/{task_id}/complete

POST /api/plans/{name}/tasks/{task_id}/abandon

POST /api/plans/{name}/archive

GET /api/plans/{name}/next?ref=main&assigned_to=me&include_unassigned=true&assigned_only=false

Authentication

See also

`GET /api/health`

`GET /api/stats/tokens`

`GET /api/stats/tokens/{session_id}`

`GET /api/stats/sessions`

`POST /api/stats/llm_usage`

`GET /api/stats/{ref_name}`

`GET /api/state/{ref_name}?path=<path>`

`GET /api/state/{ref_name}/paths?prefix=<prefix>&max_depth=<n>`

`GET /api/state/{ref_name}/search?query=<q>&max_results=<n>`

`GET /api/log/{ref_name}?limit=<n>`

`GET /api/blame/{ref_name}?path=<path>`

`GET /api/diff?ref_a=<a>&ref_b=<b>`

`GET /api/branches`

`POST /api/branches`

`POST /api/memory/remember`

`POST /api/memory/forget`

`GET /api/memory/recall?topic=<t>&budget=<n>&ref=<r>`

`GET /api/memory/context/{project}?ref=<r>`

`POST /api/memory/prime`

`GET /api/memory/pinned`

`POST /api/memory/summarize_session`

`GET /api/memory/what_changed_since?since=<iso>`

`GET /api/memory/why_did_we?decision=<text>`

`POST /api/plans`

`GET /api/plans?ref=main&status=active`

`GET /api/plans/{name}?ref=main`

`DELETE /api/plans/{name}?ref=main`

`POST /api/plans/{name}/tasks`

`GET /api/plans/{name}/tasks?ref=main`

`GET /api/plans/{name}/tasks/{task_id}?ref=main`

`POST /api/plans/{name}/tasks/{task_id}/start`

`POST /api/plans/{name}/tasks/{task_id}/complete`

`POST /api/plans/{name}/tasks/{task_id}/abandon`

`POST /api/plans/{name}/archive`

`GET /api/plans/{name}/next?ref=main&assigned_to=me&include_unassigned=true&assigned_only=false`