Codex: Super App Unifying AI Agents and Workflows

Riley Brown convinces skeptic Greg Isenberg that OpenAI's Codex, powered by GPT 5.5, outperforms Claude by combining coding, docs, browser control, automations, and Remotion videos in one GUI interface.

Codex as the Ultimate AI Super App

Riley Brown positions OpenAI's Codex—accessible via any ChatGPT subscription—as the strongest single interface for AI agents today, running on the newly released GPT 5.5 model. Unlike fragmented tools, Codex handles vibe coding (intuitive app building via natural language), knowledge work (spreadsheets, charts, Word docs, PowerPoint decks), browser automation, computer control, and scheduled automations all in one place. Brown demos creating a physics-based train simulator app complete with crash counters in one prompt, exporting decks to Canva, and generating charts from research data. He argues this eliminates context-switching: "Codex is the fastest way to do the most amount of things."

Greg Isenberg enters as a Claude Code loyalist, admitting he's never downloaded Codex and prefers his stack. Brown counters by showing how Codex stacks Claude Code inside its terminal—type claude to run Anthropic's model alongside GPT 5.5, leveraging each where it shines. Brown's team of seven engineers has fully switched, citing Codex's edge on complex infrastructure tasks, like one-shotting a mobile vibe-coding tool (Repl.it clone) in 80 minutes on GPT 5.4.

GUI Interfaces Outpace Terminals for Broad Adoption

Brown traces the evolution from 2024's terminal UIs (TUIs) like early Claude Code to 2025's dominant GUI pattern: chats on the left, agent in the middle, output on the right. He compares Codex favorably to Cursor and Claude's desktop app, which mirror this layout but split functionalities—Claude separates Cowork (business/docs) from Claude Code (coding), with differing permissions and limits. "I do not like Claude's decision to split up Cowork and Claude Code," Brown says, noting Cowork's restrictions frustrate agentic workflows.

For non-engineers, GUIs lower barriers: no terminals, no manual skill files. Brown creates projects as folders (e.g., "Startup Ideas Podcast"), auto-organizing chats with blue dots for completed tasks and spinners for active ones. Multitasking shines—spawn chats via Cmd+N, monitor progress like in Manis. Isenberg agrees business users want simplicity: "People in business just want an easier interface to do all of these agentic workflows."

Codex unifies primitives: vibe code an app, then pivot to docs without switching apps. Brown critiques Cursor for spitting out HTML previews instead of native doc views, and dismisses Claude Cowork as restrictive despite its potential.

Breakthrough Features: Browser, Remotion, Chronicle, and Plugins

Codex integrates OpenAI's Atlas browser directly, evolving into a task-specific web environment with login persistence. Brown envisions it replacing tab-cluttered browsers: open Notion via plugin, have AI edit while viewing live. Speed has hit a threshold—chess demo plays at near-human pace, ditching the "dial-up" feel of prior agents. By year-end, Brown predicts human-parity speed.

Remotion plugin turns code into motion graphics: "@Remotion create a video" generates timelines, compositions, and exports high-quality clips. Brown pulls brand assets (logos, colors, fonts) via a custom "internet image puller" skill, enabling one-shot launch videos with 800k+ views. He shares a demo video scripted entirely by AI, stressing simplicity: "Never have multiple things happening at once."

Chronicle, released days prior, adds screen-watching memory for computer use—AI controls apps like Canva, exports files, loops results back. Plugins (official: Slack, Notion, Sheets, Expo, Canva, Remotion) and user skills (folders with SKILL.md files, auto-generatable) enable deep integrations. Automations schedule one-shot workflows. Brown untangles terms: plugins are vetted, skills user-made, MCPs/connectors overlap but extend reach.

GPT 5.5 costs ~2x GPT 5.4 via API (20% over Opus 4.7), with effort sliders (low to extra high). Images 2.0 enhances visuals. Privacy flags on screen-watching, but Brown urges experimentation.

Who Codex Serves and Overcoming AI Overwhelm

Brown targets startup founders juggling docs, landing pages, lead magnets: one interface for all. Companies unlock value by feeding agents "good examples of finished work" to match quality bars via evals. Isenberg probes audience: engineers? Business users? Brown: anyone tired of tool-hopping, especially teams standardizing stacks.

Overwhelm stems from hype and fragmentation—Brown advises sticking to one stack, tinkering rabbit-hole style. Day-one projects: (1) Fun game with browser play; (2) Research-to-spreadsheet/doc/deck pipeline; (3) 3D simulation; (4) Automate annoying task. "Tinker, look dumb, and follow the rabbit holes," he closes.

Isenberg's skepticism softens: browser use feels viable, Remotion pro-level. The pitch lands as super-app convergence collapsing docs/decks/code/research silos.

"Codex models are better at really complex tasks... we've tested this extensively as a team."

– Riley Brown, on GPT 5.5 vs. competitors

"The GUI is better... chats on the left, agent in the middle, output on the right."

– Riley Brown, explaining the dominant agent interface

"If you're using Claude Code inside Cursor... great. Just keep doing that."

– Riley Brown, against tool-hopping

"By the end of the year these browser agents are going to be as fast as humans."

– Riley Brown, on speed breakthroughs

"Have fun first, build a small game and let browser use play it."

– Riley Brown, day-one advice

Key Takeaways

  • Start with Codex via ChatGPT sub; create projects as folders for organized chats and multitasking.
  • Use GUI over terminals for vibe coding, docs, and automations—spawn chats with Cmd+N, track via dots/spinners.
  • Enable plugins like Remotion for motion graphics: pull brand assets, @mention for one-shot videos.
  • Stack models: run claude in Codex terminal to blend GPT 5.5 strengths with Claude Code.
  • Day-one: game + browser play; research-to-deck; 3D sim; automate drudgery—tinker freely.
  • Feed agents finished work examples for quality; evals ensure output matches your bar.
  • Browser/computer use now near-human speed—test chess demo, expect parity by EOY.
  • Skills via SKILL.md folders: ask Codex to generate; schedule automations for recurrence.
  • Ignore splits like Claude Cowork/Code—unified interfaces win for knowledge + code work.

Summarized by x-ai/grok-4.1-fast via openrouter

9015 input / 2539 output tokens in 28133ms

© 2026 Edge