What Is OpenAI Symphony? Codex Orchestration, Issue-Driven Development, and AI Agent Workflows

OpenAI recently open-sourced an interesting Codex orchestration specification: Symphony.

It is not another chat-based coding assistant, nor is it a complete new IDE. More precisely, Symphony is a way to orchestrate work around Codex: it turns an issue tracker similar to Linear into the control plane for coding agents, so every open task can correspond to a continuously running Agent.

One line from the official article captures its direction well: in the past, engineers had to monitor multiple Codex sessions at once, continually assigning work, reviewing output, correcting course, and restarting sessions. Symphony is designed to address exactly that context-switching bottleneck.

Symphony is not solving code writing, but Agent management

A single Codex session works well for interactive development: you give it a task, it changes the code, you review it, and then you keep asking follow-up questions. But once a team starts using multiple Agents at the same time, the problem shifts from “can the code be written?” to “who is working on what, how far along is it, and who takes over after a failure?”

OpenAI’s approach is to move the center of gravity from “sessions” to “tasks”:

the issue is the real unit of work;
every open issue can map to an independent Agent workspace;
Symphony continuously polls the task board and decides which tasks should be started, retried, stopped, or reclaimed;
Codex performs implementation, testing, commits, PR creation, status updates, and related actions inside the workspace;
humans no longer micromanage every session, but instead review results, adjust goals, and maintain boundaries.

The shift behind this is important: an Agent is no longer just a tool that humans temporarily summon, but a continuously running kind of executor inside the development workflow.

Why an issue tracker?

Because teams already use issue trackers to manage real work.

Requirements, bugs, refactors, migrations, research, priorities, blockers, owners, and milestones are already recorded in Linear, GitHub Issues, or similar systems. Symphony does not reinvent a large console. Instead, it treats these existing systems as the task entry point for Agents.

This has several advantages:

Work does not need to be copied from an issue into a chat window.
Humans can keep creating, splitting, scheduling, and closing tasks in familiar ways.
Agent state changes can be written back to the same work system, making async collaboration easier for the team.
Task dependencies can naturally form a DAG, allowing unblocked tasks to move forward in parallel.

If traditional CI is “automation after a code commit,” Symphony is closer to “automation after an issue is created.”

Its core workflow

A typical Symphony flow can be understood as:

1
2
3
4
5
6
7
8


创建 issue
  -> Symphony 轮询到可执行任务
  -> 为该 issue 创建独立 workspace
  -> 启动 Codex agent session
  -> Agent 阅读任务、修改代码、运行测试
  -> 创建或更新 PR
  -> 写回任务状态、评论、证据和交付物
  -> 人类 review、合并或要求修改

The official specification also emphasizes several engineering details:

each issue uses an independent workspace to reduce cross-contamination;
the orchestrator maintains retry, concurrency, and recovery state;
workflow policy lives in the repository’s WORKFLOW.md, so teams can version the rules that describe how Agents should handle tasks;
implementations need to preserve observability, with at least structured logs;
a successful state does not have to be Done; it can also be an intermediate state handed to humans for review.

This shows that Symphony is not simply about “letting AI write code automatically.” It defines a runnable, recoverable, and auditable Agent work system.

Goal-driven, not a rigid state machine

OpenAI mentions an important shift in the article: early on, they tried hard-coding many actions in the outer harness, such as committing code, running tests, and handling GitHub workflows. But as Codex became more capable, that approach started to constrain the Agent.

The later direction was to give the Agent a goal, rather than encoding every step as a fixed state transition.

For example, a task’s goal might be “complete the Vite migration and ensure CI passes.” The Agent can decide for itself whether it needs to change configuration, fix tests, read CI logs, handle review feedback, or even create new follow-up issues. Symphony provides boundaries, context, and the runtime framework instead of prescribing every action for the Agent.

This is also where it differs from traditional automation scripts: scripts are good at repeated, deterministic processes; Symphony is aimed at engineering tasks with uncertainty.

How is this different from normal Codex usage?

A normal Codex session is more like “a human writes code with AI”:

the human opens a session;
the human describes the task;
the human watches the output;
the human corrects course at any time;
after one task ends, the human starts the next session.

Symphony is more like “a team hands a task pool to a group of Agents”:

humans write clear issues;
the system continuously discovers executable tasks;
Agents make progress in isolated environments;
results come back as PRs, comments, test status, videos, or analysis reports;
humans review at key checkpoints.

This is not about replacing engineers. It is about freeing engineers from the burden of simultaneously watching many sessions. OpenAI notes in the official article that some teams saw a significant increase in PRs merged to the main branch. But the more important point is the change in working style: the startup cost of trying an idea, launching a refactor, or validating a hypothesis becomes lower.

Where does it fit?

Symphony is better suited to tasks such as:

routine feature implementation;
small refactors in an existing codebase;
infrastructure migrations;
dependency upgrades;
filling in tests;
CI fixes;
research followed by an implementation plan;
continuing to revise a PR based on review feedback.

It is not necessarily a good fit for highly ambiguous tasks that require strong business judgment or architectural decisions. For those problems, an interactive Codex session is still more natural because humans need to stay involved throughout the process.

Risks and boundaries

Symphony is appealing, but in real adoption, teams cannot look only at the “automation” side.

Several boundaries need to be made clear in advance:

issues must be written clearly, otherwise Agents will amplify vague requirements into incorrect implementations;
Agent permissions should be constrained, especially access to repositories, secrets, production environments, and third-party services;
every workspace should be isolated to avoid contamination between tasks;
CI, tests, lint, and review remain necessary quality gates;
task status, PR links, logs, and failure reasons need to be traceable;
human review cannot be skipped, especially for changes involving security, billing, data migration, and permission logic.

The official repository also positions Symphony as an engineering preview and reference implementation for trusted environments, not a finished platform that can blindly replace a development process.

My understanding of Symphony

The most valuable part of Symphony is not that it uses Linear, nor that the reference implementation chose Elixir. Its value is that it redefines the entry point for programming Agents.

In the past, we were used to starting AI coding from a chat window. That is flexible, but once the scale grows, human attention becomes the bottleneck. Symphony puts the entry point back in the issue tracker and lets Agents work continuously around real tasks. In that sense, AI coding starts moving from a “personal productivity tool” toward “team workflow infrastructure.”

If you are already using Codex, Claude Code, Cursor Agent, or similar tools, the most important thing to notice about Symphony is not any specific implementation, but the pattern behind it:

Do not only manage Agent sessions. Manage the work that needs to be done.

This may become a key dividing line for the next stage of AI coding tools.