Kilo VS Code: Free Parallel AI Agents & Worktrees

Kilo's rebuilt VS Code extension shares CLI core for faster features, adds parallel tool calls/subagents, Git worktrees for isolation, and free access via Kilo/OpenRouter/NVIDIA models—turning it into a GA AI coding tool.

Shared Core Architecture Delivers Consistent, Fast Updates

Kilo rebuilt its VS Code extension on the same portable core as Kilo CLI, eliminating prior inconsistencies across CLI, JetBrains, and VS Code. This unification speeds feature delivery, improves performance, and simplifies maintenance—VS Code now acts as another frontend, ensuring parity. Result: New capabilities like parallel processing roll out everywhere simultaneously, avoiding the 'special snowflake' delays of the old VS Code-tied version.

Parallelism and Isolation Supercharge Agent Workflows

Parallel tool calls enable simultaneous file reads, searches, and commands, while parallel subagents delegate tasks—one implements features, another tests, a third reviews—boosting throughput over serial chats. Define custom subagents for team workflows. Agent Manager organizes multiple sessions for switching/comparing without tab chaos. Git worktrees isolate attempts in separate branches/workspaces, preventing conflicts during experimentation. Side-by-side comparisons test models/strategies directly; inline code review adds diff comments plus chat summaries, mimicking real dev reviews. Unified agents interface integrates sessions/reviews cohesively, with provider settings, MCP marketplace, session imports, dedicated terminals, and CLI/cloud sync for seamless cross-platform continuity.

Free Model Setup Unlocks Production-Ready Testing

Skip subscriptions: Use Kilo's built-in free-tier models (labeled 'free'). Connect OpenRouter API key for Qwen 3 Coder Free, GLM 4.5 Air Free, DeepSeek-R-10528 Free, Kimmy K2 Free—enable prompt training in OpenRouter settings if needed. For NVIDIA NIMs, use OpenAI-compatible provider: paste NVIDIA API key/base URL/model ID (e.g., Kimmy, GLM, MiniMax) for free developer access (testing terms apply, not infinite production). Pair with free Codestral (Mistral) autocomplete. All integrate into one workflow: select provider/model, leverage agents/worktrees/reviews consistently. April 2, 2026 update fills beta gaps, making it feel GA.

Video description
In this video, I'll be talking about Kilo Code's rebuilt VS Code extension, what is now officially live in the new version, and how you can use it for free through Kilo's built-in models, OpenRouter, NVIDIA, and free autocomplete options. -- Key Takeaways: 🚀 Kilo’s rebuilt VS Code extension now runs on the same portable core as Kilo CLI, making feature delivery faster and more consistent. ⚡ The new live version adds parallel tool calls, parallel subagents, and a much more practical multi-agent workflow. 🗂️ Agent Manager and Git worktree support make it easier to manage multiple sessions and keep different coding attempts isolated. 🔍 Side-by-side comparisons and inline code review make testing models and reviewing code much more useful inside the editor. 🧩 Kilo now offers a more unified Agents experience, along with better provider settings, session importing, terminals, and MCP marketplace support. 💸 You can use Kilo for free through Kilo’s own free models, OpenRouter free-tier models, NVIDIA developer access, and free Codestral autocomplete. 👍 Overall, the April 2, 2026 update makes Kilo feel much more like a real GA product rather than just an early rebuild preview.

Summarized by x-ai/grok-4.1-fast via openrouter

5665 input / 1108 output tokens in 9491ms

© 2026 Edge