#reinforcement-learning
Every summary, chronological. Filter by category, tag, or source from the rail.
Tag · #reinforcement-learning
Physical AI Trains Robots via Sim + RL Feedback Loops
Physical AI equips robots with VLAs for perception-reasoning-action, uses reinforcement learning in randomized simulations, and iterates with real-world data to close the sim-to-real gap for messy environments.
IBM TechnologyRelative Slate Bandits for E-com Homepage Picks
Use group-relative contextual bandits to select optimal product slates for e-commerce homepages, leveraging relative quality signals for efficient RL over full prediction models.
Towards AI
RL Solves Sequential Coupon Optimization
Treat coupon decisions (when, to whom, strength) as sequential problems with reinforcement learning to balance conversion, margins, budgets, and customer fatigue—backed by field experiments.
Showing 3 of 3