SUMMARY · AI & LLMs

Shorten CLAUDE.md from 910 to 33 lines to save 4% context instantly; break tasks into skills (27% vs 45% usage), use references/sub-agents, and commands like /compact to reclaim over 50% total.

12 Rules to Halve Claude Code Context Usage

Filed by Jono Catliff · PublishedApril 6, 2026

1 MIN READ · SUMMARY

Video description

🌍 COMMUNITY https://www.skool.com/automatable/about 📝 FREE BLUEPRINTS Find every single one of my free YouTube blueprints (including these above) here: https://www.skool.com/automatable-free/about 📚 SUMMARY I use Claude Code every single day - and the #1 thing that kills your output is a bloated context window. In this video I break down 12 practical ways to keep your context sharp so Claude actually does what you want. From trimming your CLAUDE.md file to using skills, reference files, sub-agents, and more. ⌛ TIMESTAMPS 0:00 - Why Your Context Window Fills Up 0:21 - Optimizing Your CLAUDE.md File 1:25 - Adding a Context Warning Rule 2:29 - Breaking Workflows Into Skills 4:07 - Using Reference Files as Reusable Templates 6:14 - Handling Large Files (4 Methods) 7:25 - Switching Models for More Context 7:55 - Using /context to Audit Usage 8:40 - Using /clear to Reset 9:17 - Using /compact to Shrink History 10:01 - Managing Memory 11:20 - Removing Unused MCP Connectors 12:23 - Using Sub-Agents for Large Tasks 14:04 - Outro + Free Resources 📣 SOCIAL MEDIA • Instagram → https://instagram.com/jono_catliff • TikTok → https://www.tiktok.com/@jonocatliff • LinkedIn → https://www.linkedin.com/in/jonocatliff/ • X → https://twitter.com/@jonocatliff 📺 RELATED VIDEOS • Full crash course on Make.com → https://youtu.be/hinLebdX8aM • Full crash course on n8n →https://youtu.be/AURnISajubk • 11 Favourite Make.com automations → https://youtu.be/dIH1F1WlE84 • 12 Favourite n8n automations → https://youtu.be/uQGT2K26W84 🎯 1:1 CONSULTING Book a time → https://jonocatliff.com/consultation 🚀 AUTOMATION AGENCY Get help with your business → https://www.automatable.co 🔗 LINKS (some of these make me money - thanks in advance!) • n8n → https://jonocatliff.com/n8n • Make.com → https://jonocatliff.com/make • Go High Level → https://jonocatliff.com/gohighlevel • Apify → https://jonocatliff.com/apify • Skool → https://jonocatliff.com/skool • Zapier → https://jonocatliff.com/zapier • PandaDoc → https://jonocatliff.com/pandadoc • Apollo → https://jonocatliff.com/apollo • ManyChat → https://jonocatliff.com/manychat • Vapi → https://jonocatliff.com/vapi • PhantomBuster → https://jonocatliff.com/phantombuster • ClickUp → https://jonocatliff.com/clickup • ElevenLabs → https://jonocatliff.com/elevenlabs • Upwork → https://jonocatliff.com/upwork • Instantly.ai → https://jonocatliff.com/instantly • Airtable → https://jonocatliff.com/airtable 👋 ABOUT ME Hey everyone, my name is Jono. I run a 7-figure service business that offers DJ, photo, video services (#1 largest in Canada), and spent years figuring out how to automate every part of it (and hired the roles that I couldn't). Conservatively, I used to work 80+ hours per week, before sunrise till long after sunset; missing gatherings, family events and everything in between. Through automation though, I was able to replace my job. My goal is to help share what worked for me, in a dream of helping others find true success with their passion. Please subscribe, like and comment below if you have any questions! Thank you 😊 #ClaudeCowork #AIAutomation #ClaudeAI #NoCode #AIAgents

Optimize Core Files to Minimize Baseline Context

Trim your CLAUDE.md file ruthlessly: bloated versions at 910 lines consume 45% context for project analysis, while a 33-line version drops to 41%, saving 4% per interaction. Add a rule like "When context exceeds 50%, suggest new conversations or sub-agents to reduce it," so Claude proactively flags bloat at 75% usage and proposes fixes, preventing manual compaction.

Break monolithic workflows into granular skills (e.g., one for LinkedIn posts, emails, proposals, or CSV analysis). Skills load only relevant context—analyzing a bank CSV with a dedicated skill uses 27% vs. 45% when dumping questions into a generic CLAUDE.md. Create reference files for reusables like tone or banned phrases; prompt Claude to "reference if needed, skip otherwise." Baking them in bloats skills to 457 lines (31% usage); referencing slims to 31 lines (25% usage).

For large files like 3,001-line transcripts, attach as filesystem references, not chat messages: pasting consumes 71%, referencing drops to 38%—nearly halving it. Switch models strategically: Haiku burns 33% on a simple "Hey"; Opus uses just 9%, freeing headroom for complex tasks.

Audit and Reset Conversation History

Run /context anytime to break down usage: it lists tokens/percentages by category (e.g., MCP tools, memory, skills), revealing culprits even in basic chats. Hit /clear or new tab to reset fully when at 2-3% capacity. For salvageable history, use /compact: it summarizes long threads into a tiny prompt (specify keepers like key decisions), restarting fresh without losing essence—ideal at 90-100% bloat.

Purge Persistent Overhead and Offload Tasks

Query "check all my memories" to list Claude's stored facts (e.g., 17 personal/workflow items); delete irrelevants like demo projects ("delete everything about Hierarchy") to trim hidden drag. Run "Claude MCP list" then visit claude.ai/settings/connectors to revoke unused integrations—3 connectors like Slack/Airtable already eat substantial tokens; 20-40 would explode it.

For massive tasks (e.g., junk folders with large/binary files), spawn sub-agents: prompt "use sub-agents to extract questions, action items, decisions separately." This silos work—each handles 33%, avoiding overload in the main thread. Claude defaults to this for big projects, but explicit requests ensure it, distributing context across threads for reliable outputs.

#llm #prompt-engineering #agents #ai-automation

View original source

Video description

Optimize Core Files to Minimize Baseline Context

Audit and Reset Conversation History

Purge Persistent Overhead and Offload Tasks

More from AI & LLMs

Agentic Search Powers 80% of LLM Context Engineering

Fix Prompt Fragility by Decomposing Agents into Microservices

Verifier Agent Crushes AI Coding Review Bottleneck

Slash 98% MCP Tokens via Code Execution & 9 More Tricks