№ 02 / SUMMARIES

#on-device-ai

Every summary, chronological. Filter by category, tag, or source from the rail.

Tag · #on-device-ai
DAY 01May 22, 2026 MAY 22 · 20261 SUMMARIES
AI EngineerAI & LLMs

Building AI-Powered Android Apps with Gemini Nano

Android developers can leverage Gemini Nano via the AI Core system service for on-device inference, or use hybrid inference to fall back to cloud models, ensuring privacy and efficient resource management without managing model deployment.

AI Engineer
DAY 02May 20, 2026 MAY 20 · 20261 SUMMARIES
AI EngineerAI & LLMs

Fine-Tuning Tiny LLMs for On-Device AI Agents

Developers can achieve production-grade performance on-device by choosing between system-level models (Gemini Nano) for general tasks or fine-tuning tiny LLMs (<1B parameters) via LiteRT-LM for specialized, high-accuracy agentic workflows.

AI Engineer
DAY 03May 11, 2026 MAY 11 · 20261 SUMMARIES
AI Engineer

MLX: Frontier AI Fully On-Device on Apple Silicon

MLX runs real-time vision, <100ms TTS, omni models, 426B LLMs, and text-to-video on 16GB Mac VRAM—no cloud. Turbo Quant cuts KV cache 4x for 1M contexts, enabling accessibility and robots in low-connectivity areas.

AI Engineer

Showing 3 of 3