AI News: Spud, Conway Agent, Cursor 3, Gemma 4 Drops

OpenAI's Spud (GPT-6?) eyes spring 2026 with superior reasoning; Anthropic's Conway enables always-on browser automation; Cursor 3 runs multi-agents across envs; Qwen 3.6+ hits 1M tokens, Gemma 4 runs on iPhone at 40k tok/s.

Frontier Closed Models Push Reasoning and Multimodality

OpenAI's Spud, internally labeled GPT-5.5 or potentially GPT-6, prioritizes raw intelligence over projects like Sora, targeting spring 2026 release. Greg Brockman describes it as having 'big model smell'—more intuitive adaptation to user intent, complex long-term reasoning, and flexible handling of tasks beyond fine-tunes. A trusted source notes improvements over GPT-5.4 but not matching top tiers like Anthropic's models yet. Separately, OpenAI's GPT-Image-2 checkpoint on Arena (under codenames masking tape alpha, gaffer tape alpha, packing tape alpha) excels in world knowledge, near-perfect text rendering, and replicating specifics like doctor notes or company logos—testable now in Arena's battle mode.

Anthropic's Conway is an always-on agent running in its own UI instance, automating browsers via connectors and Claude Code, with webhook triggers and extensibility via upcoming CNW ZIP for custom tools, UI tabs, and context handlers. Claude Code's new /ultraplan (via command, prompt, or web refine) shifts detailed planning to browser-based cloud execution for better design alignment before local implementation, browser reviews for readability, and flexible remote/local execution—currently research preview. Anthropic also integrates Deepgram Nova-3 for voice, expanding Claude to multimodal speech understanding/generation, likely in next releases like Mythos or Sonnet 5.

Coding IDEs and Agent Workflows Evolve

Cursor 3 redesigns for agent-heavy coding, running multiple agents locally, via SSH, or cloud, with a separate window surfacing editor features contextually to complement full IDEs. Anthropic Pro/Max plans ($20-200/month) end third-party tool coverage (e.g., OpenClaw) from April 4, requiring extra billing; users get one-time credits equal to subscription value and pre-purchase discounts, ending arbitrage where $200 plans ran thousands in workloads.

Open Models Excel in Context, Speed, and Benchmarks

Alibaba's Qwen 3.6-Plus delivers 1M token context, 78.8 on SWEBench (vs. Claude 3 Opus at 80.9), outperforming Opus on most benchmarks with stronger coding, cheaper pricing, image/screen understanding like a real user, and reliability in real-world tasks. Google's Gemma 4 family (Apache 2.0), built on Gemini 3 research, supports multimodal inputs (text, images, audio, video), long context, reasoning/coding; ranks #3 on Arena. The 2B variant runs on iPhone 17 Pro at ~40k tokens/sec via MLX optimization, enabling on-device multimodal AI. DeepSeek V4 launches in weeks, first frontier Chinese model native on Huawei Ascend chips—Alibaba, ByteDance, Tencent ordering thousands, with prices up 20%; signals China's reduced NVIDIA dependency using domestic compute stacks now viable at scale.

These updates highlight accelerating agent automation, on-device feasibility, and hardware diversification, with specific benchmarks and access points for immediate testing.

Video description
This week in AI is absolutely massive! From OpenAI’s Spud (GPT 6) to Anthropic’s Conway agent, Claude Code Ultra, and cutting-edge releases like Qwen 3.6-Plus, Gemma 4, and Cursor 3, the AI landscape is moving faster than ever. 🔗 My Links: Sponsor a Video or Do a Demo of Your Product, Contact me: intheworldzofai@gmail.com 🔥 Become a Patron (Private Discord): https://patreon.com/WorldofAi 🧠 Follow me on Twitter: https://twitter.com/intheworldofai 🚨 Subscribe To The SECOND Channel: https://www.youtube.com/@UCYwLV1gDwzGbg7jXQ52bVnQ 👩🏻‍🏫 Learn to code with Scrimba – from fullstack to AI https://scrimba.com/?via=worldofai (20% OFF) 🚨 Subscribe To The FREE AI Newsletter For Regular AI Updates: https://intheworldofai.com/ 👾 Join the World of AI Discord! : https://discord.gg/NPf8FCn4cD Something coming soon :) https://www.skool.com/worldofai-automation [Must Watch]: Claude Code Computer Use Can Control Your ENTIRE Computer! Automate Your Life!: https://youtu.be/KiywNP4b0aw?si=HuJnvik0AgLjIkCb Turn Antigravity Into AN AI Autonomous Engineering Team! Automate Your Code with Subagents!: https://www.youtube.com/watch?v=yuaBPLNdNSU Gemini 3.5? NEW Gemini Stealth Model Is POWERFUL & Fast! (Fully Tested): https://youtu.be/1abLcL33eKA?si=H50xRhJxVYM7HFPK 📌 LINKS & RESOURCES https://x.com/chatgpt21/status/2039447583936901340 https://x.com/testingcatalog/status/2039490365414048182 https://x.com/bcherny/status/2040206440556826908 https://x.com/oikon48/status/2040442838127944009/photo/2 https://x.com/himanshustwts/status/2040383160249381044/photo/2 https://code.claude.com/docs/en/ultraplan https://x.com/adrgrondin/status/2040512861953270226 https://x.com/cursor_ai/status/2039768512894505086 https://arena.ai/ Here’s what’s happening: OpenAI Spud (GPT 6/GPT-5.5): New base model with “big model smell,” more intuitive, capable of complex reasoning, and longer time horizons. GPT-Image-2: OpenAI’s powerful new image model with insane world knowledge and text rendering. Anthropic Conway: Always-on agent capable of browser automation, connectors, Claude Code, and extensible with CNW ZIP. Claude Code Ultra: New Ultraplan feature allows detailed planning in the browser with flexible execution locally or on the web. Anthropic Voice Models: Claude now moving into voice with Deepgram Nova 3, expanding into multimodality. Cursor 3: Run multiple coding agents anywhere—locally, SSH, cloud—with a flexible interface that complements IDEs. DeepSeek V4: Launching soon and running natively on Huawei chips, marking a major step in China’s move away from NVIDIA. Qwen 3.6-Plus: 1M token context window, stronger coding, multimodal understanding, and more reliable in real-world tasks. Gemma 4: Open-source model built on Gemini 3 research, image understanding, reasoning, and running on-device on iPhone 17 Pro at ~40k tokens/sec with MLX optimization. The AI race is accelerating, and the future of agents, multimodal AI, and high-performance models is happening right now. [Time Stamp]: 0:00 - Introduction 1:10 - OpenAI GPT-6 (Spud) 2:49 - GPT Image 2 3:49 - Anthropic Conway 4:26 - Anthropic Pro/Max Plan Changes 4:49 - Claim Anthropic Credits 5:41 - Claude Code /ultraplan 6:38 - Anthropic Voice Model 7:19 - Cursor 3 8:00 - Hauwei Model Training 9:19 - Deepseek v4 Release Date 9:24 - Qwen 3.6 Plus 10:12 - Gemma 4 10:58 - Gemma 4 2B Running On The Phone Stay tuned for more updates! Tags/Keywords (comma-separated): AI news, OpenAI Spud, GPT-6, GPT-5.5, GPT-Image-2, Anthropic Conway, Claude Code Ultra, Claude Ultraplan, Claude Voice, Deepgram Nova 3, Cursor 3, DeepSeek V4, Huawei AI chips, Qwen 3.6-Plus, Gemma 4, Gemini 3, iPhone AI, multimodal AI, AI agents, AI coding, AI models 2026, AI breakthroughs, AI updates Hashtags: #AI #OpenAI #Spud #ClaudeConway #ClaudeCodeUltra #Cursor3 #DeepSeekV4 #Qwen36Plus #Gemma4 #AInews #MultimodalAI #AITechnology #FutureOfAI

Summarized by x-ai/grok-4.1-fast via openrouter

7363 input / 1785 output tokens in 15994ms

© 2026 Edge