#open-source
Every summary, chronological. Filter by category, tag, or source from the rail.
OpenAI's Patch the Planet Initiative for Open Source Security
OpenAI has launched 'Patch the Planet,' a collaboration with security firm Trail of Bits, to provide open source maintainers with expert security reviews and AI-assisted tooling to identify and remediate vulnerabilities.
OpenSkill: Enabling Self-Evolution in Open-World LLM Agents
OpenSkill is a framework designed to allow LLM agents to autonomously improve their capabilities in open-world environments through iterative self-evolution, bypassing the limitations of static training data.
How to Install the Home Assistant Community Store (HACS)
HACS enables custom integrations and themes in Home Assistant. Installation requires a GitHub account and varies slightly depending on whether you use HAOS/Supervised or Container/Core setups.
Why Cloudflare Acquired the Vite Team
Cloudflare acquired VoidZero, the company behind Vite, to accelerate the development of an agent-first, full-stack deployment experience that simplifies infrastructure provisioning for AI-generated applications.
Maximilian SchwarzmullerSecurity Risks of Autonomous AI Agents: The OpenClaw Case
Autonomous AI agents like OpenClaw introduce significant security vulnerabilities by running untrusted code with local system privileges, enabling risks like prompt injection, credential theft, and autonomous lateral movement.
IBM TechnologySIA: Self-Improving Agents That Evolve Scaffold and Weights
Hexo Labs' open-source SIA framework enables AI agents to autonomously improve by iteratively updating both their operational harness (prompts/tools) and internal model weights (via LoRA) within a single feedback loop.
ByteDance's Lance: A Unified 3B Model for Vision and Video
Lance is an open-source, 3B parameter unified model that natively integrates image and video understanding, generation, and editing within a single jointly trained framework.
Osaurus: Mac LLM Server for Local/Cloud Model Switching
Osaurus open-source server runs local/cloud AI models on Macs, switches models on-demand, sandboxes for security, needs 64GB+ RAM.
Supertonic v3: 99M-Param On-Device TTS Beats Cloud Rivals
Supertonic v3 runs TTS on-device via ONNX with 31 languages, expressive tags like <laugh>, and flawless handling of $5.2M or 30kph—outperforming ElevenLabs/OpenAI on complex text at 404MB total size and 0.3x RTF on e-readers.
Cline SDK: Open-Source Modular Runtime for AI Agents
Cline's @cline/sdk extracts its agent runtime into a layered TypeScript stack, enabling portable, durable AI coding agents that beat benchmarks like 74.2% on Claude Opus 4.7 vs. Anthropic's 69.4%.
Clawdmeter: Desk Hardware for Claude Token Tracking
Open-source ESP32 device animates Clawd sprite based on your Claude Code token usage, displays charts via Bluetooth, and sends keyboard shortcuts—built in days with Claude's help.
GLiGuard: 300M Safety Model Beats 90x Larger Rivals
Deploy GLiGuard, a 300M encoder model, for LLM safety moderation: matches accuracy of 23-90x larger models across 9 benchmarks while running 16x faster at 26ms per request.
Agents Train Models via Hugging Face Skills
Hugging Face skills let coding agents fine-tune VLMs like Qwen2VL on datasets like LLaVA Instruct Mix with one prompt: agents calculate VRAM, pick instances, and launch jobs remotely or locally.
AntAngelMed: 103B MoE Medical LLM Matches 40B Dense at 7x Speed
103B-param open-source medical LLM activates only 6.1B params via 1/32 MoE, rivals 40B dense models with 7x efficiency, tops HealthBench/MedBench, runs 200+ tps on H20.
TwELL Delivers 20% LLM Speedups via GPU-Optimized Sparsity
Use ReLU gate activation + L1=2e-5 on hidden activations to induce 99.5% sparsity in feedforward layers, then TwELL CUDA kernels yield 20.5% inference and 21.9% training speedups on H100s with no accuracy loss.
Replay Logs Fail Agents: Use VM Snapshots Instead
Replay durability constrains agent code with growing logs; split into context logs (DB durable) and execution snapshots (14MB Firecracker VMs, <1s save/100ms restore) for multi-day sessions.
AI EngineerHermes Desktop App Enables Easy Self-Evolving AI Agents
Hermes Agent runs 24/7 persistent, self-improving AI agents locally with long-term memory and closed learning loops; new Desktop App adds intuitive UI for setup, multi-agent management, and tools on Windows, macOS, Linux.
Rust CUDA Kernels via Direct PTX Compilation
cuda-oxide lets you write safe Rust SIMT GPU kernels that compile directly to PTX using a custom rustc backend, skipping C++ or DSLs—host/device in one .rs file, with cargo oxide build producing binary + .ptx.
Build Hermes AI Agent: VPS Setup to Scaled Automations
Follow this step-by-step guide to deploy Hermes Agent on a VPS, integrate Telegram, create skills/crons, backup to GitHub, and scale multiple agents for proactive AI assistance.
Trigger.dev: Async Infra Powers 90% AI Agents
Trigger.dev evolved from Zapier-for-devs background jobs to a reliable SDK for executing AI agents, hitting PMF with v3's hosted execution and checkpoint-resume primitives—perfectly timed for agent era, now 90% usage from agents.
Y CombinatorSymphony: Agents Autonomously Claim and Complete Tasks
OpenAI's Symphony uses issue trackers like Linear to let coding agents claim tasks, spin up isolated workspaces, and only ping humans for reviews—solving the 3-5 session supervision bottleneck. Install by prompting an agent with a 2000+ line spec to build it.
Spec-Kit: Specs-First AI Coding for Reliable Production Code
GitHub's open-source Spec-Kit (90k+ stars) uses Spec-Driven Development to ground AI agents in structured specs, generating testable code that matches intent—fixing 'vibe-coding' failures in prototypes turned production.
Anthropic Open-Sources Wall St Analyst Agents
Anthropic released 10 end-to-end Claude agents mimicking Goldman Sachs analyst roles, with prompts, checklists, 11 licensed data connectors, and 7 vertical bundles—democratizing workflows once locked behind $25k terminals and bank secrecy.
AI Summaries (evaluation playlist)Zig Rejects Bun's Fork Over LLM Policy and Flawed Speed Hack
Bun's Zig fork uses LLM for 4x faster debug builds via parallel analysis, but Zig rejects it for non-determinism risks and upstream incompatibility; Zig prioritizes careful engineering with LLVM bypass for true 40s-to-0.5s speedups.
TokenSpeed Beats TensorRT-LLM 9-11% on Agentic Coding Inference
TokenSpeed open-source engine optimizes agentic workloads with long contexts (>50K tokens) and multi-turn convos, delivering 9% lower latency and 11% higher throughput than TensorRT-LLM at 70-100 TPS/user on NVIDIA B200.
DeepSeek-TUI: Viral Open-Source Claude Code Rival
DeepSeek-TUI, a Rust-based terminal AI coding agent powered by DeepSeek V4's 1M-token context, hit 10k+ GitHub stars in days as a cheap, customizable alternative to Claude Code, built by a music/law student using AI-assisted coding.
OpenClaw's April Shift: Model-Swappable Agent Runtime
OpenClaw evolved from viral demo to durable agent runtime with task orchestration, mature memory, and channels—enabling workflows that swap models like Claude, Codex, or Gemma 4 to survive provider changes.
IBM Granite Speech 4.1: 3 ASR Models for Accuracy, Features, Speed
IBM's 2B Granite Speech 4.1 suite offers three trade-offs: base leads Open ASR Leaderboard (WER 5.33, RTF 231), Plus adds diarization/timestamps, NAR hits RTF 1820 on H100 via transcript editing.
637MB LLM Runs Offline on Base MacBook Air, Works Surprisingly Well
TinyLlama, a 637MB open-source LLM, runs instantly on a stock MacBook Air via Ollama—no internet, GPU, or API needed—handling Node.js servers and casual chats effectively, lowering the bar for useful local AI.
SIE: Dynamic Inference for Small Models on Shared GPUs
Open-source SIE engine from Superlinked enables hot-swapping small embedding models (e.g., Stella, ColBERT) on one GPU via LRU eviction, cutting costs and solving context rot in agents by preprocessing data.
Showing 30 of 182